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A METHOD FOR IDENTIFYING A SYNTHETIC MOLECULE HAVING AFFINITY 
TOWARDS A TARGET • 

This application is a non-provisional of U.S. provisional application Serial No. 
60/483,899 filed on 2 July 2003, which is hereby incorporated by reference in its 
entirety, All patent and non-patent references cited in the application, or in the pres- 
ent application, are also hereby incorporated by reference in their entirety. 

Technical Field of the Invention 

The present invention relates to a method for identifying from a library a synthetic 
molecule having affinity towards a target The synthetic molecule has initially been a 
part of a complex also comprising an identifier that codes for said synthetic 
molecule. The invention further relates to a library of complexes, which in a certain 
embodiment can be used in the method. The experiments supporting the present 
method suggest an enrichment of more than a million times, e.g. a specific synthetic 
molecule may be identified in a library of 10 6 + complexes. 

Background 

Traditional drug discovery.begins with a pathological phenomenon in aaoroanfem 
and the development of a therapeutic theory to combat this. A chemical concept 
follows to produce compounds for screening. Most of the processes far curing the 
pathological phenomenon originate with the understanding of some biological path- 
ways and screening for an effect iatissues or cells. This may or may not eventually 
reveal a -target". The target can be identified by various conventional methods, in- 
• eluding protein expressing, protein chemistry, structure-functional studies, knowl- 
edge of biochemical pathways, and genetic studies. 

In recent years, genetic information has increasingly guided the identification of sin- 
gle molecular targets. These are derived from the knowledge of the genes of spe- 
cific cell phenotypes that encode proteins that may be involved in the pathogenesis 
of a particular disease state. 

A lead is a compound, usually a small organic molecule that demonstrates a desired 
biological activity on a target Usually, a collection of compounds, referred to as a 
"library . is screened before a useful lead is identified. Today, many libraries are 
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commercially available or open to public. Most pharmaceutical companies house 
their own compilation of compounds that have been synthesised over several years 
and screened against a variety of targets. 

Each compound in a library must be screened by an appropriate assay against the 
target Techniques for handling the screening of several thousands compounds si- 
multaneously have been developed and are generally referred to as high-throughput 
screening techniques. To push the limit of compounds possible to screen simultane- 
ously. different manufactures have been developing instrumentation capable of 
handling multiple micro titer plate formats on the same platform using 384 and 1536- 
well plates. Advances in small volume liquid dispensing and pipetting, reliable han- 
dl.ng of standardized plates and simplified assay formats all have made an impact 
on the reliability of the high-throughput screening process. 

However, high-throughput screening has the disadvantage that each of the .com- 
pounds have to be positioned in spatially discrete regions, usually in wells of a micro 
trter plate in order to observe an interaction with a target. If more than a single com- 
pound is present, it is not feasible to discern which compound displaying the appro- 
pnate biological activity. Thus, the full power of combinatorial chemistry cannot be 
applied because a collection of compounds usually is produced in a single con- 
tainer. 

To be able to select a possible lead compound in a collection of compounds placed 
in the same contained, libraries of Afunctional complexes have been evolved Each 
Afunctional complex in the library comprises a potential lead compound coupled to 
an identifier sequence. The identifier sequence is suitably a nucleic acid which 
uniquely identifies the potential lead compound. When a library of Afunctional com- 
plexes is screened against a target, one or more of the potential leads may bind to 
the target After removal of the remainder of the .bay, the binding Afunctional 
complexes can be eluated and the lead compound identified by sequencing the 
identifier. 

Various techniques for producing Afunctional complexes are known from the prior 
art. Some attempts to form the complex comprising a molecule as well as the identi- • 
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Omer attempts have focussed on the formation of encoded proteins Z^nT' 
Summary of the Invention 

The present invention concerns a mettiod for identifying a svnth^v , , 

a ) providing a library of bifunctional complexes wherein 

said molecule, "wwner. which codes for 

lestoaCf^ 

c ) removing the non-binding members of the library. 
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d) separating the identifiers of complexes comprising synthetic molecules 
having affinity towards the target, and • 

e) decoding the identifiers to establish the identity of the molecule. 

The present invention offers a novel method for enrichment of libraries of bifunc- 
tional complexes. In certain aspects, the method takes advantage of retaining the 
non-specific binding complexes bound to the target, walls of wells, beads etc while 
the identifiers of the complexes having specific affinity towards the target are sepa- 
rated. Thus, the present invention provides in certain aspects a method not only 
selecting such complexes which survives a selection assay, as the non-specific 
binding complexes are excluded from the pool of identifier separated. 

While it is not desired to be bound by any particular theory, it is presently believed 
that the identifier part of the complexes possesses an inherent affinity towards the 
surfaces of a reaction chamber or parts of the target not involved in the binding of an 
agonist or antagonist. The separation step ensures that the non-specific binding 
complexes are retained in the vessel. The identifiers attached to the synthetic mole- 
cule may be separated- by cleaving different kind of linkages. Preferably the linkage 
is selectively cleavable. i.e. when the media comprising the library of complexes is 
exposed to a certain condition, only the Intended linkage is cleaved. 

In one embodiment of the invention, a cleavable linker moiety is situated between 
the synthetic molecule and the identifier. When the complex is subjected to condi- 
tions cleaving the linker, only such identifiers that are not adhered to solid surfaces 
etc. are liberated into the liquid media. The identifiers in the liquid media may easily 
be recovered, thereby separating these from the non-specific binding, identifiers re- 
maining immobilized. In another embodiment of the invention, a cleavable linker is 
positioned between a target and a solid surface, thereby immobilizing the target 
Subsequent to exposing the immobilized target to a library of complexes and re- 
moving the non-binding part of the library, the complexes attached to the target may 
separated by exposing the media to a condition cleaving the linkage between the 
target and the solid surface. If desirable the identifier can be isolated and amplified 
using standard molecular methods, such as PCR. 
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The second embodiment may be combined with chromatography. Following the 
separation of the complexes attached to the target, the media may be subjected to 
chromatography, e.g. size exclusion chromatography. In certain embodiments after 
the chromatography process another orthogonal cleavable linkage between the 
i synthetic molecule and the identifier may be cleaved to liberate the identifier. 

The invention also relates to a library of complexes, in which each different complex 
comprises a synthetic molecule, attached via a cleavable linkage to an identifier 
which codes for said molecule. The linkage may be cleaved by a variety of different 
conditions, such as electromagnetic radiation, chemical agents, and enzymes. In a 
preferred aspect the selective cleavable linkage comprises a chemical moiety, which 
can be cleaved by electromagnetic irradiation, such as light having a specified wave 
length. A useful chemical moiety for the cleavable linkage comprises a group 

NO2 R 2 in which R 1 and R 2 are either of the synthetic molecule or the 
5 identifier, respectively, R 3 is H or OCH 3 . and X is O. S, or NH. The identifiers pref- 
erably comprise a sequence of nucleotides. In one aspect of the invention, the iden- 
tifier comprises 2 or more codons. which codes for 2 or more chemical entities in- 
corporated into the synthetic molecule. 

) Complex 

The complex of the present invention comprises a synthetic molecule and an identi- 
fier. The identifier comprises identifying moieties that identifies the molecule. Pref- 
erably, the identifier identifies the molecule uniquely, i.e. in a library of complexes a 
particular identifier is capable of distinguishing the molecule it is attached to from the 
rest of the molecules. 

Tne molecule and the identifier may be attached directly to each other or through a 
bridging moiety. In one aspect of the invention, the bridging moiety is a selectively 
cleavable linkage. The identifying moieties of each complex suitably comprise rec- 
ognition units. i.e. units which may be recognized by a detecting entity. A variety of 
different kinds of recognition exist in nature. Examples include antibodies which rec- 
ognize an epitope, proteins which recognize another protein. mRNA which recog- 
nize a protein, small molecules (like biotin) which recognize a protein (like avidine or 
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streptavidine) and oligonucleotides which recognizes complementing oligonucleo- 
tides. Generally it is preferred that the identifier is a sequence of nucleotides. 

The method may in certain embodiments be performed without amplification after 
5 the separation step. However, when larger libraries are used and the amount of 
separated identifiers Is relatively low, it is in general preferred to use an identifier 
which is amplifiable. Identifiers comprising a sequence of nucleotides may be ampli- 
fied using standard techniques, like PCR. In the event the identifier is a protein, the 
protein may be amplified by attaching the mRNA which encoded the synthesis 
1 a thereof, generating the cDNA from the mRNA and subjecting said mRNA to a 

translation system. ( Such system is described in WO 98/31700 the content of which 
is incorporated herein by reference. An alternative method for amplifying a protein is 
to use phage-displayed proteins. 

15 The identifier may comprise two or more codons. The sequence of codons can be 
decoded to identify reactants usedin the formation of the molecule. When the iden- 
tifier comprises more than one codon, each member of a pool of building blocks can 
be identified uniquely and the order of codons is informative.of the synthesis step 
each member has been incorporated in. 
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The sequence of the nucleotides in each codon may have any suitable length. The 
codon may be a single nucleotide or a plurality of nucleotides. In some aspects of 
the invention, it is preferred that each codon independently comprises four or more 
nucleotides, more preferred 4 to 30 nucleotides. 



The identifier will in general have at least two codons arranged in sequence, i.e. 
next to each other. Two neighbouring codons may be separated by a framing se- 
quence. Depending on the encoded molecule formed, the identifier may comprise 
further codons, such as 3, 4, 5, or more codons. Each of the further codons may be 
30 separated by a suitable framing sequence. Preferably, all or at least a majority of the 
codons of the identifier are separated from a neighbouring codon by a framing se- 
quence. The framing sequence may have any suitable number of nucleotides, e.g. 1 
to 20. AJternatively, codons on the identifier may be designed with overlapping se- 
quences. 

35 
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The framing sequence, if present, may serve various purposes. In one setup of the 
invention, the framing sequence idenfifies.the position of the codon. Usually the 
framing sequence either upstream or downstream of a codon comprises information 
which allows determination of the position of the codons. In another setup the 
> frames have alternating sequences, allowing foraddition of building btocks from two 
pools in the formation of me library, me framing sequence may also or in addition 
provide for a region of high affinity. The high affinity region may ensure that the hy- 
bnd,sation of the template with an anB-codon will occur in frame. Moreover the 
framing sequence may adjust the annealing temperature to a desired level." 

A framing sequence with high affinity can be provided by incorporation of one or 
more nucleobases forming three hydrogen bonds to a cognate nucleobase Exam- 
ples of nucleobases having this property are guanine and cytosine. Alternatively or 
•n addrt.on. the framing sequence may be subjected to backbone modification Sev 
eral back bone modifications provides for higher affinity, such as 2--0-methyl sub 
aftutbn of the ribose moiety, peptide nucleic acids (PNA). and ZA' ^methylene 
cycl.sation of the ribose moiety, also referred to as LNA (Locked Nucleic Acid). 

The identifier may comprise flanking regions around me codons. The flanking region 
can encompass a signal group, such as a fiourophor or a radio active group to allow 
for detection of the presence or absence of a complex or the flanking region may 
00^ a label that may be detected, such as biotin. When the identifier compnses 
a biofan moiety, the identifier may easily be recovered following the separation step. 

The flanking regions can also serve as priming sites for amplification reactions such 
as PGR. The identifier may in certain embodiments comprise an affinity region hav- 
ing the property of being able to hybridise to a building block. 

It is to be understood mat when the term identifier is used in the present description 
and daims. the identifier may be in the sense or the anti-sense format, i.e the iden- 
tifier can be a sequence of codons which actually codes for the molecule or can be a 
sequence complementer thereto. Moreover, the identifier may be single-stranded 
or double-stranded, as appropriate. 
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The synthetic molecule part of the complex is generally of a structure expected of 
having an effect on the target When the target is of pharmaceutical importance, the 
molecule is generally a possible drug candidate. The complex may be formed by 
tagging a library of different possible drug candidates with a tag, e.g. a nucleic acid 
i tag uniquely identifying each possible drug candidate. In another embodiment of the 
invention, the molecule is encoded; i.e. formed by a variety of reactants which have 
reacted with each other and/or a scaffold molecule. Optionally, this reaction product 
may be post-modified to obtain the final molecule displayed on the complex. The 
pc>st-modification may involve the cleavage of one or more chemical bonds attach- 
ing the encoded molecule to the indentifier in order more efficiently to display the 
encoded molecule. 



15 



20 



The formation of an encoded molecule generally starts by a scaffold, Le. a chemical 
unit haying one or more reactive groups capable of forming a connection to another 
reactive group positioned on a chemical entity, thereby generating an addition to the 
original scaffold. A second chemical entity may react with a reactive group also 
appearing on the original scaffold or a reactive group incorporated by the first 
chemical entity. Further chemical entities may be involved in the formation of the 
final reaction product. The formation of a connection between the chemical entity 
and the nascent encoded molecule may be mediated by a bridging molecule. As an 
example, if the nascent encoded molecule and the chemical entity both comprise an 
amine group a connection between these can be mediated by a dicarboxylic acid. A 
synthetic molecule is in general produced in vitro and may be a naturally occurring 
or an artificial substance. Usually, a synthetic molecule is not produced using the 
25 naturally translation system in an in vitro process. 

The chemical entities that are precursors for structural additions or eliminations of 
the encoded molecule may be attached to a building block prior to the participation 
in the formation of the reaction product leading the final encoded molecule. Besides 
the chemical entity, the building block generally comprises an anti-codon. In some 
embodiments the building blocks also comprise an affinity region providing for 
affinity towards the nascent complex 

Thus, the chemical entities are suitably mediated to the nascent encoded molecule 
35 by a building block, which further comprises an anticodon. The anti-codon serves 
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the function of transferring the genetic information of the building block in 
conjunction with the transfer of a chemical entity. The transfer of genetic information 
and chemical entity may occur in any order, however, it is important that a 
correspondence is maintained in the complex The chemical entities are preferably 
reacted without enzymatic interaction in some aspects of the invention. Notably, the 
reaction of the chemical entities is preferably not mediated by ribosomes or 
enzymes having similar activity. In another aspect of the invention a ribosome is 
used to translate an mRNA into a protein using a tRNA loaded with a natural or 
unnatural amino acid. 

According to certain aspects of the invention the genetic information of the anti- 
codon is transferred by specific hybridisation to a codon on a nucleic acid template. 
Another method for transferring the genetic information of the anti-codon to the 
nascent complex is to anneal an oligonucleotide complementary to the anti-codon 
and attach this oligonucleotide to the complex, e.g. by ligation. A still further method 
involves transferring the genetic information of the anti-codon to the nascent 
complex by an extension reaction using a polymerase and a mixture of dNTPs. 

The chemical entity of the building block may in most cases be regarded as a pre- 
cursor for the structural entity eventually incorporated into the encoded molecule. In 
other cases the chemical entity provides for the eliminations of chemical units of the 
nascent encoded molecule. Therefore, when it in the present application with claims 
is stated that a chemical entity is transferred to a nascent encoded molecule it is to 
be understood that not necessarily all the atoms of the original chemical entity is to 
be found in the eventually formed encoded molecule. Also, as a consequence of the 
reactions involved in the connection, the structure of the chemical entity can be 
changed when it appears on the nascent encoded molecule. Especially, the cleav- 
age resulting in the release of the entity may generate a reactive group which in a 
subsequent step can participate in the fomiation of a connection between a nascent 
complex and a chemical entity. 

The chemical entity of the building block comprises at.least one reactive group ca- 
pable of participating in a reaction which results in a connection between the chemi- 
cal entity of the building block and another chemical entity or a scaffold associated 
with the nascent complex. The number of reactive groups which appear on the 



WO 2005/003778 PCT/DK2 004/000466 

10 

chemical entity is suitably one to ten. A building block featuring only one reactive 
group is used i.a. in the end positions of polymers or scaffolds, whereas building 
blocks having two reactive groups are suitable for the formation of the body part of 
polymer or scaffolds capable of being reacted further. One, two or more reactive 
5 groups intended for the formation of connections, are typically present on scaffolds. 
Non-J/rnffing examples of scaffolds are opiates, steroids, benzodiazepines, hydan- 
toines, and peptidylphosphonates. 

The reactive group of the chemical entity may be capable of forming a direct con- 
) nection to a reactive group of the nascent complex or the reactive group of the 
building block may be capable of forming a connection to a reactive group of the 
nascent complex through a bridging fill-in group. It is to be understood that not all 
the atoms of a reactive group are necessarily maintained in the connect/on formed. 
Rather, the reactive groups are to be regarded as precursors for the stmcture of the 
connection. 

The subsequent cleavage step to release the chemical entity from the building block 
can be performed in any appropriate way. In an aspect of the inventibn the cleavage 
involves usage of a chemical reagent or an enzyme. The cleavage results in a 
transfer of the chemical entity to the nascent encoded molecule or in a transfer of 
the nascent encoded molecule to the chemical entity of the building block. In some 
cases it may be advantageous to introduce new chemical groups as a consequence 
of linker cleavage. The new chemical groups may be used for further reaction in a 
subsequent cycle, either directly or after having been activated. In other cases it is 
desirable that no trace of the linker remains after the cleavage. 

In another aspect, the connection and the cleavage is conducted as a simultaneous 
reaction, i.e. either the chemical entity of the building block or the nascent encoded 
molecule is a leaving group of the reaction. In some aspects of the invention, it is 
appropriate to design the system such that the connection and the cleavage occur 
simultaneously because this will reduce the number of steps and the complexity. 
The simultaneous connection and cleavage can also he designed such that ejther 
no trace of the linker remains or such that a new chemical group for further reaction 
is introduced, as described above. 
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The attachment of the chemical entity to the building block, optionally via a suitable 
spacer can be at any entity avaflable for attachment, e.g. the chemical entity can be 
attached to a nucleobase or the backbone. In general, it is preferred to attach the 
chemical entity at the phosphor of the intemucleoside linkage or at the nucleobase 
5 When the nucleobase is used for attachment of the chemical entity, the attachment 

pointisusuallyat^eZpositionofthepurinesorr^Jeaza-purinsorattheSposition 
of pynmidines. The nucleotide may be distanced from the reactive group of the 
chemical entity by a spacer moiety. The spacer may be designed such that the 
conformational spaced sampled by the reactive group is optimized for a reaction 
) with the reactive group of the nascent encoded molecule. 

The synthetic molecules of the invention may have any chemical structure. In a 

preferred aspect, the synthetic molecule can be any compound that may be 
synthesized in a component-by-component fashion. In some aspects the synthetic 
molecule is a linear or branched polymer. In another aspect the synthetic molecule 
■s a scaffolded molecule. The term "synthetic molecule" also comprises naturally 
occurring molecules like a-polypeptides etc. however produced in vitro usually in the 
absence of enzymes, like ribosomes. In certain aspects, the synthetic molecule of 
the library is a non-a-polypeptide. 

The synthetic molecule may have any molecular weight. However, in order to be 
orally available, it is in this case preferred that the synthetic molecule has a 
molecular weight less than 2000 Daltons. preferably less than 1000 Dalton. and 
more preferred less than 500 Daltons. 

The size of the library may vary considerably pending on the expected result of the 
inventive method. In some aspects, it may be sufficient that the library comprises 
two, three, or four different complexes. However, in most events, more than two 
Afferent complexes are desired to obtain a higher diversity. In some aspects the 
library comprises 1.000 or more different complexes, more preferred 1.000.000 or 
more different complexes. The upper limit for the size of the library is only restricted 
by the size of the vessel in which the library is comprised. It may be calculated that a 
vial may comprise up to. 10" different complexes. 
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Methods for forming libraries of complexes 

The complexes comprising an identifier having two or more codons that codes for 
reactants that have reacted in the formation of the molecule part of the complex may 
be formed by a variety of processes. Generally, the preferred methods can be used 
for the formation of virtually any kind of encode molecule. Suitable.examples of pro- 
cesses include prior art methods disclosed in WO 93/20242, WO 93/06121, WO 
00/23458. WO 02/074929, and WO 02/103008, the content of which being incorpo- 
rated herein by reference as well as methods of the present applicant not yet public 
available, including the methods disclosed in DK PA 2002 01955 filed 19 December 
2002, and DK PA 2003 00430 filed 20 March 2003. Any of these methods may be 
used, and the entire content of the patent applications are included herein by refer- 
ence. 

Below four preferred embodiments are described. A first embodiment disclosed in 
more detail in WO 02/103008 is based on the use of a polymerase to incorporate 
unnatural nucleotides as building blocks. Initially, a plurality of template 
oligonucleotides is provided. Subsequently primers are annealed to each of the 
templates and a polymerase is extending the primer using nucleotide derivatives 
which have appended chemical entities. Subsequent to or simultaneously with the 
incorporation of the nucleotide derivatives, the chemical entities are reacted to form 
a reaction product The encoded molecule may be post-modified by cleaving some 
of the linking moieties to better present the encoded molecule. 

Several possible reaction approaches for the chemical entities are apparent. First, 
the nucleotide derivatives can be incorporated and the chemical entities 
subsequently polymerised. In the event the chemical entities each carry two reactive 
groups, the chemical entities can be attached to adjacent chemical entities by a 
reaction of these reactive groups. Exemplary of the reactive groups are amine and 
carboxylic acid, which upon reaction form an amide bond. Adjacent chemical entities 
can also be linked together using a linking or bridging moiety. Exemplary of this 
approach is the linking of two chemical entities each bearing an amine group by a 
bi-carboxylic acid. Yet another approach is the use of a reactive group between a 
chemical entity and the nucleotide building block, such as an ester or a thioester 
group. An adjacent building block having a reactive group such as an amine may 
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cleave the interspaced reactive group to obtain a linkage to the chemical entity e a 
by an amide linking group. ' 

A second embodiment for obtainment of complexes pertains to the use of 
5 hybridisation of building blocks to a template and reaction of chemical entities 

attached to the building blocks in order to obtain a reaction product This approach 
compnses that templates are contacted with a plurality of building blocks wherein 
each bunding block comprises an ant«odon and a chemical entity. The anti-codons 
are designed such that they recognise a sequence, i.e: a codon. on the template 
Subsequent to the annealing of the anti-codon and the codon to each other a 
reaction of the chemical entity is effected. 

The template may be associated with a scaffold. Building blocks bringing chemical 
entities ,n may be added sequentially or simultaneously and a reaction of the 
reactive group of the chemica, entity may be effected at any time after the anneaiing 
of the building blocks to the template. 

A third embodiment for the generation of a complex includes chemical or 
enzymatical ligation of building blocks when these are lined up on a template 
Initially, temp.ates are provided, each having one or more codons. The tempia.es 
are contacted with building blocks comprising anti-codons linked to chemical 
entities. The two or more anti-codons anneaied on a template are subsequently 
■-gated to each otherand a reaction of the chemical entities is effected to obtain a 
reaction product The method is disclosed in more detail in DK PA 2003 00430 filed 
20 March 2003. 

A fourth embodiment makes use of the extension byapolymerase of an affinity 
sequence of the nascent complex to transfer the anti-codon of a building block to the 
nascent complex. The method implies that a nascent compiex comprising a scaffold 
and an affinity region is annealed to a building block comprising a region 
complementary to the affinity section. Subsequently the anti-oodon region of the 
balding block is transferred to the nascent complex by a polymerase. The transfer 
of the chemicai entity may be transferred prior to. simultaneously with or subsequent 
to the transfer of the anti-codon. This method is disclosed in detail in DK PA 2002 
01955 filed 19 December 2002. 
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Thus, the codons are either pre-made into one or more templates before the 
encoded molecules are generated or the codons are transferred simultaneously with 
the formation of the encoded molecules. 

5 

After or simultaneously with the formation ofthe reaction product some of the linkers 

to the template may be cleaved, however at least one linker must be maintained to 
provide for the complex. 

10 Nucleotides 

The nucleotides used in the present invention, may be linked together in a sequence 
of nucleotides, i.e. an oligonucleotide. Each nucleotide monomer is normally com- 
posed of two parts, namely a nucleobase moiety, and a backbone. The back bone 
may ,n some cases be subdivided into a sugar moiety and an intemucleoside linker. 

The nucleobase moiety may be selected among naturally occurring nucleobases as 
well as non-naturally occurring nucleobases. Thus, "nucleobase" includes notonly 
the known purine and pyrimidine hetero-cydes. but also heterocyclic analogues and 
tautomers thereof. I.lustrafive examples of nucleobases are adenine, guanine thy- ' 
mme. cytosine, uracil, purine, xanthine, diaminopurine. 8^xo-N 8 -methyladenioe 7- 
deazaxanthine, 7-deazaguanine. N\N'-ethanocytosin. ^.N'-ethano-2,6-diamino- 
purine. 5-methylcytosine. S-^-C^-alkynyicytosine. 5-fluorouradl, 5-bromouracil ' 
pseudoisocytosine. a-hydroxy-S-methyl^-triazolopyridine. isocytosine. isoguanine 
mosme and the "non-naturally occurring" nucleobases described in Benneret al ' 
U.S. Pat No. 5.432.272. The term "nucleobase" is intended to cover these examples 
as well as analogues and tautomers thereof. Especially interesting nucleobases are 
adenine, guanine, thymine, cytosine. 5-memylcytosine. and uracil, which are con- 
sidered as the naturally occurring nucleobases in relation to therapeutic and diag- 
nostic application in humans. 

Examples of suitable specific pairs of nucleobases are shown below: 
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• Natural Base Pairs 

R=rt UracS ' 

ffc'CHj: Thymine Cytasine 



& ° OCT 



Guanine 



Synthetic Base Pairs 



Backbone 



/=\ Bactton. N^s\ 

Bacfctafw 

Synthsttc purine bases pairrlng with natural pyrlmldlrwe 

R=H: Uracil 

R=CH> Thymine Cytosine 



ibOM 

7-deaza adenine 



examples of backbone units are shown below (B denotes a nucleobase): 
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■<w, 



S N A bH 

0-£O" 0=^-0- o^-o- R o^-O- 

DNA Oxy-LNA Thio-LNA Amino-LNA 

of* o-p- o-^-o- 0=^,. ^ 

Pho.phonhioatt r-0-M«|,y| T-MOE 2--F-ANA 

2*"AP HNA CcNA PNA ? \ 





is 9 



Morphalino 



-BHV 



2X3-hydmxy)p«>py > S'-Phosphonunidate 



TNA 



The sugar moiety of the backbone is suitably a pentose but may be the appropriate 
part of an PNA or a six-member ring. Suitable examples of possible pentoses in- 
clude ribose, 2'-deoxyribose, 2'-0-methyi-ribose, 2'-flour-ribose 1 and 2 , -4'-0- 
methylene-ribose (LNA). Suitably the nucleobase is attached to the 1' position of the 
pentose entity. 

An internucleoside linker connects the 3' end of preceding monomer to a 5* end of a 
succeeding monomer when the sugar moiety of the backbone is a pentose, like ri- 
bose or 2-deoxyribose. The internucleoside linkage may be the natural occurring 
phospodiester linkage or a derivative thereof. Examples of such derivatives include 
phosphorothioate, methylphosphonate, phosphoramidate. phosphotriester, and 
pbosphodithioate. Furthermore, the internucleoside linker can be any of a number of 
non-phosphorous-containing linkers known in the art. 
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Preferred nucleic acid monomers include naturally occurring nucleosides forming 
part of the DNA as Well as the RNA family connected through phosphodiester link- 
ages. The members of the DNA family include deoxyadenosine, deoxyguanosine. 
5 deoxythymidine. and deoxycytidine. Tne members of the RNA family include adeno- 
sine, guanosine, uridine, cytidine, and inosine. Inosine Is a non-specific pairing nu- 
cleoside and may be used as universal base because inosine can pair nearly iso- 
energetically with A. T, and C. Other compounds having the same ability of non- 
specifically base-pairing with natural nucledbases have been formed. Suitable com- 
1 0 pounds which may be utilized in the present invention includes among others the 
compounds depicted below 
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Building block 

The chemical entities that are precursors for structural additions or eliminations of 
the encoded molecule may be attached to a building block prior to the participation 
5 in the formation of the reaction product leading the final encoded molecule: Besides 
. the chemical entity, the building block generally comprises an anti-codon. 

The chemical entity of the building block comprises at least one reactive group ca- 
. pable of participating in a reaction which results in a connection between the chemi- 
10 cal entity of the building block and another chemical entity or a scaffold associated 
with the nascent complex. The connection is facilitated by one or more reactive 
groups of the chemical entity. The number of reactive groups which appear on the 
chemical entity is suitably one to ten. A building block featuring only one reactive 
group is used La. in the end positions of polymers or scaffolds, whereas building 
blocks having two reactive groups are suitable for the formation of the body part of a 
polymer or scaffolds capable of being reacted further. One, two or more reactive 
groups intended for the formation of connections, are typically present on scaffolds. 

, The reactive group of the building block may be capable of forming a direct connec- 
20 tion to a reactive group of the nascent complex or the reactive group of the building 
block may be capable of forming a connection to a reactive group of the nascent 
complex through a bridging fill-in group. It is to be understood that not ail the atoms 
of a reactive group are necessarily maintained in the connection formed. Rather, the 
reactive groups are to be regarded as precursors for the structure of the connection 

5 

The subsequent cleavage step to release the chemical entity from the building block 
can be performed in any appropriate way. In an aspect of the invention the cleavage 
involves usage of a reagent or an enzyme. The cleavage results in a transfer of the 
chemical entity to the nascent encoded molecule or in a transfer of the nascent en- 
coded molecule to the chemical entity of the building block. In some cases it may be 
advantageous to introduce new chemical groups as a consequence of linker cleav- 
age. The new chemical groups may be used for further reaction in a subsequent 
cycle, either directly or after having been activated. In other cases it is desirable that 
no trace of the linker remains after the cleavage. ■ 
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In another aspect, the connection and the cleavage is conducted as a simultaneous 
reaction, i.e. either the chemical entity of the building block or the nascent encoded 
molecule is a leaving group of the reaction .In general.it is preferred to design the 
system such that the connection and the cleavage occur simultaneously because 
5 this will reduce the number of steps and the complexity. The simultaneous connec- 
tion and cleavage can also be designed such that either no trace of the linker re- 
mains or such that a new chemical group for further reaction is introduced, as de- 
scribed above. 

10 The attachment of the chemical entity to the building block, optionally via a suitable 
spacer can be at any entity available for attachment, e.g. the chemical entity can be 
attached to a nucleobase or the backbone. In general, it is preferred to attach the 
chemical entity at the phosphor of the intemucleoside linkage or at the nucleobase. 
When the nucleobase is used for attachment of the chemical entity, the attachment 
15 point is usually at the 7 position of the purines or 7-deaza-purins or at the 5 position 
of pyrimidines. The nucleotide may be distanced from the reactive group of the 
chemical entity by a spacer moiety. The spacer may be designed such that the 
conformational space sampled by the reactive group is optimized for a reaction with 
the reactive group of the nascent encoded molecule or reactive site. 
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The anticodon complements the codon of the identifier sequence and generally 
comprises the same number of nucleotides as the codon. The anticodon may be 
adjoined with a fixed sequence, such as a sequence complementing a framing se- 
quence. 

Various specific building blocks are envisaged. Building blocks of particular interest 
are shown below. 

Building blocks transferring a chemical entity to a recipient nndeophilic nm„ p 
The building block indicated below is capable of transferring a chemical entity (CE) 
to a recipient nucleophilic group, typically an amine group. The bold lower-horizontal 
line illustrates the building block and the vertical line illustrates a spacer. The 5- 
membered substituted N-hydroxysucdnimid (NHS) ring serves as an activator, i.e. a 
labile bond is formed between the oxygen atom connected to the NHS ring and the 
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chemical entity. The labile bond may be cleaved by a nucleophilic group, e.g. posi- 
tioned on a scaffold 




The 5-membered substituted N-hydroxysucdnimid (NHS) ring serves as an activa- 
tor, i.e. a labile bond is formed between the oxygen atom connected to the NHS ring 
and the chemical entity. The labile bond may be cleaved by a nucleophilic group, 
e.g. positioned on a scaffold, to transfer the chemical entity to the scaffold, thus 
converting the remainder of the fragment into a leaving group of the reaction. 
When the chemical entity is connected to the activator through an carbonyl group 
and the recipient group is an amine, the bond formed on the scaffold will an amide 
bond. The above building block is the subject of the Danish patent application No. 
PA 2002 01946 and the US provisional patent application No. 60/434,439, the con- 
tent of which are incorporated herein in their entirety by reference. 

Another building block which may form an amide bond Is ' 




R may be absent or N0 2 , CF3. halogen, preferably CI, Br, or I, and Z may be S or O. 
This type of building block is disclosed in Danish patent application No. PA 2002 
0951 and US provisional patent application filed 20 December 2002 with the title "A 
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building block capable of transferring a functional entity to a recipient reactive 
. group". The content of both patent application are incorporated herein in their en- 
tirety by reference. 

A nucleophilic group can cleave the linkage between Z and the carbonyi group 
thereby transferring the chemical entity -^OJ-CE' to said nucleophilic group. 

Building blocks tra nsferring a chemical entity to a recipient reactive group forming a 
OCbond 

A building block as shown below are able to transfer the chemical entity to a recipi- 
ent aldehyide group thereby forming a double bond between the carbon of the alde- 
hyde and the chemical entity 



15 




The above building block is comprised by the Danish patent application No. DK PA 
2002 01952 and the US provisional patent application filed 20 December 2002 with 
the title "A building block capable of transferring a functional entity to a recipient re- 
active group forming a C=C double bond". The content of both patent applications 
are incorporated herein in their entirety by reference. 

Building blocks transferri ng a chemical entity to a recipient reactive group forming * 
C-C bond 

The below building block is able to transfer the chemical entity to a recipient group 
thereby forming a single bond between the receiving moiety, e.g. a scaffold, and the 
chemical entity. ^ 
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The above building block is comprised by the Danish patent application No. DK PA 
2002 01947 and the US provisional patent application No 60/434,428. The content 
of both patent applications are incorporated herein in their entirety by reference. 

« : - 
Another building block capable of transferring a chemical entity to a receiving reac- 
tive group forming a single bond is 



-O — -CE 



The receiving group may be a nucleophile, such as a group comprising a hetero 
atom, thereby forming a single bond between the chemical entity and the hetero 
atom, or the receiving group may b<* an electronegative carbon atom, thereby form- 
ing a C-C bond between the chemical entity and the scaffold. 

The chemical entity attached to any of the above building blocks may be a selected 
from a large arsenal of chemical structures. Examples of chemical entities are 
H or entities selected among the group consisting of a C r C 6 alkyl, Cr-Ca alkenyl, 
C2-C fl alkynyi, C 4 -C 8 alkadienyl, C r C 7 cydoalkyl, C r C 7 cycloheteroalkyl, aryl, and 
heteroaryl, said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or d-C 3 al- 
kylene-NR 4 2l C r C 3 afkylene-NR 4 C(0)R 8 , C t -C 3 alkylene-NR 4 C(0)OR 8 , d-Cj al- 
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kyleneO-NR 4 2l C,-C 2 alkylene-0-NR 4 C(0)R a , d-C, alkylene-0-NR 4 C<0)OR a S ub- 
stittrted with 0-3 R s . 

where R 4 is H or selected independently among the group consisting of 
d-Ce alkyf, C r C 6 alkenyl, Cz-Qj alkynyl, C r C 7 cydoa/kyl, C3-C7 cycloheteroalkyl, 
5 aryl. heteroaryi, said group being substituted with 0-3 R 8 and 

R 5 is selected independently from -N 3 , -CNO, -C(NOH)NH 2 , -NHOH, 
-NHNHR 6 , -C(0)R 6 , -SnR a 3 , -B(OR 6 ) 2 , -P(0)(OR 6 ) 2 or the group consisting of CVQ, 
alkenyl. C^Ce alkynyl, C 4 -C 8 alkadienyl said group being substituted with 0-2 R 7 , 

where R 8 is selected independently from H, C.-Qj aikyl, C±C 7 cycloal- 

0 ' kyl, aryl or d-Ce alkylene-aryl substituted with 0-5 halogen atoms selected from -F, 
-CI, -Br. and -I; and.R 7 is independently selected from -N0 2 , -COOR 6 , -COR 6 , -CN, 
-OSiR e 3 . -OR 8 and-NR 6 2 . 

R a is H, CrCe alkyl, CrCe alkenyl, Cj-Ce alkynyl, CrC 7 cycloalkyl, aryl or C r Ce 
alkylene-aryl substituted with 0-3 substituents independently selected from -F, -CI, - 

1 N0 2l -R 3 , -OR 3 , -SiR 3 3 

R 9 is =0, -F, -CI. -Br, -1. -CN. -N0 2 , -OR 6 . -NR 6 2 , -NR 8 -C(0)R 8 . -NR 8 -C(0)OR 8 , -SR fl , 
-S(0)R 8 , -S(0) 2 R 6 , -COOR 8 , -qO)NR 6 2 and -S(0) 2 NR s 2 . 

Cross-link cleavage building blocks 

It may be advantageous to split the transfer of a chemical entity to a recipient reac- 
tive group into two separate steps, namely a cross-linking step and a cleavage step 
because each step can be optimized. A suitable building block for this two step pro- 
cess is illustrated below: 




'FEP 



A 



Initally, a reactive group appearing on the functional entity precursor (abbreviated 
FEP) reacts with a recipient reactive group, e.g. a reactive group appearing on a 



WO 2005/003778 PCT/DK2004/000466 

25 

scaffold, thereby forming a cross-link. Subsequently, a deavage is performed, usu- 
. ally by adding an aqueous oxidising agent such as l 2 , Br 2f Cl 2( H* or a Lewis add. 
The cleavage results in a transfer of the group HZ-FEP- to the recipient moiety, such 
as a scaffold. 

,5 

In the above formula 

ZisO.S.NR 4 
QisN.CR 1 

P is a valence bond. 0, S, NR 4 , or a group C^arylene, C^alkylene 
10 C^O-alkylene, d^S-alkytene, NR'-afkylene, C^alkylene-O, d^afkyiene-S option 
said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or d-d alky1ene-NR 4 2 , 
d-d alky1ene-NR 4 C(0)R 8 , d-d a!kyiene-NR 4 C(0)OR a , C r C 2 alkyfene-0-NR 4 2 , ' 
d-d alkylene-0-NR 4 C(0)R a , d-d alkylene-0-NR 4 C(0)OR 8 substituted with 0-3 
R 9 , 

15 B is a group comprising D-E-F, in which 

D is a valence bond or a group C^alkylene, d^alkenylene, d- 
ealkynylene, C^arylene, or d-rheteroarylene, said group optionally being substi- 
tuted with 1 to 4 group R 11 , 

E is, when present, a valence bond, O, S, NR 4 , oragroup C*. 
ealkylene, C^alkenylene, C^alkynylene, d-rarylene. or C^heteroarylene, said 
group optionally being substituted with 1 to 4 group R 11 , 

F is, when present, a valence bond, O, S, or NR 4 , 
A is a spacing group distancing the chemical structure from the com- 
plementing element, which may be a nucleic add, 
25 Rl » R * and r3 are independent of each other selected among the 

group consisting of H, d-d alky!, d-d alkenyl, d-d alkynyl, d-d alkadienyl, 
d-d cycloalkyl, d-d cycloheteroalkyl, aryl, and heteroaryl, said group being sub- 
stituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 or d-d alkylene-NR 4 2 , d-d al- 
kylene-NR 4 C(0)R a , d-d alkylene-NR 4 C(0)OR a , d-d alkylene-0-NR 4 2l d-d al- 
30 W*ne-0-NR 4 C<0)R^ 

FEP is a group selected among the group consisting of H, d-d alkyl, 
.CrC alkenyl, d-d alkynyl, d-d alkadienyl, d-d cydoalkyl, d-d cydoheteroal- 
kyl, aryl. and heteroaryl, said group being substituted with 0-3 R 4 , 0-3 R 5 and 0-3 R 9 
or C r d alkylene-NR 4 2 , d-d alkylene-NR 4 C(0)R a , d-d alkylene-NR 4 C(0)OR a , 
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d-C 2 alkylene-0-NR 4 2l d-d alkylene-0-NR 4 C(0)R fl , d-d al- 
kylene-0-NR 4 C(0)OR 8 substituted with 0-3 R 9 , 

where R 4 is H or selected independently among the group consisting of 
d-C 6 alkyl, C 2 -C 6 alkenyl, d-C a alkynyl, C r C 7 cycloalkyl, Qrd cycloheteroalkyl, 
5 aryl, heteroaryl, said group being substituted with 0-3 R 9 and 

R 5 is selected independently from -N 3l -CNO, -C(NOH)NH 2 , -NHOH, 
.NHNHR 6 , -C(0)R 6 . -SnR 8 3 , -B(OR 6 ) 2 , -P(0)(OR 6 ) 2 or the group consisting of d-d 
alkenyl, d-C 6 alkynyl, C 4 -C 6 alkadienyl said group being substituted with 0-2 R 7 . 

where R 8 is selected independently from H, d-d alkyl. d_C 7 cycloal- 
1 0 kyi, aryl or d-C 6 alkylene-aryl substituted with 0-5 halogen atoms selected from -F, - 
-CI, -Br, and -4; and R 7 is independently selected from -N0 2 , -COOR 8 , -COR 8 , -CN, 
-OSiR fi 3l -OR 8 and-NR 6 2 . 

R a is H, C,-d alkyl, d-d alkenyl, d-d alkynyl. d-d cycloalkyl, aryl or C r C 6 
alkylene-aryl substituted with 0-3 substituents independently selected from -F, -CI, - 
5 N0 2 , -R 3 , -OR a , -SiR 3 3 

R 9 is =0, -F, -CI, -Br, -I, -CN, -NO* -OR 8 . -NR 6 2 , -NR 8 -C(0)R 8 , -NR 8 -C(0)OR fl , -SR 8 , 
-S(0)R 8 , -S(0) 2 R 6 , -COOR 8 , -C(0)NR 6 2 and -S(0) 2 NR 6 2 . 

In a preferred embodiment Z is O or S, P is a valence bond, Q is CH, B is CH 2 , and 
R\ R 2 , and R 3 is H. The bond between the carbonyl group and Z is cleavabie with 
aqueous l 2 . 

Contacting between target and library 

The contacting step, by which the library of bifunctionai molecules is subjected un- 
der binding conditions to a target, may be referred to as the enrichment step or the 
selection step, as appropriate, and includes the screening of the library for synthetic 
molecules having predetermined desirable characteristics. Predetermined desirable 
characteristics can include binding to a target, catalytically changing the target, 
chemically reacting with a target in a manner which alters/modifies the target or the 
functional activity of the target, and covalently attaching to the target as in a suicide 
inhibitor. 

In theory, molecules of interest can be selected based on their properties using ei- 
ther physical or physiological procedures. The method preferred according to the 
present invention is to enrich molecules with respect to binding affinity towards a 
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target of interest In a certain embodiment, the basic steps involve mixing the library 
of complexes with the immobilized target of interest The target can be attached to a 
column matrix or micratitre wells with direct immobilization or by means of antibody 
binding or other high-affinity interactions: In another embodiment, the target and 
displayed molecules interact without immobilisation of the target Displayed mole- 
cules that bind to the target will be retained on this surface, while nonbinding dis- 
played molecules will be removed during a single or a series of wash steps. The 
identifiers of complexes bound to the target can then be separated by cleaving the 
physical connection to the synthetic molecule. It may be considered advantageously 
to perform a chromatography step after of instead of the washing step. After the 
cleavage of the physical link between the synthetic molecule and the identifier, the 
identifier may be recovered from the media and optionally amplified before the de- 
coding step. 

A significant reduction in background binders may be obtained with increased 
washing volumes, repeating washing steps, higher detergent concentrations and 
prolonged incubation during washing. Thus, the more volume and number of steps 
used in the washing procedure together with more stringent conditions will more ' 
efficiently remove non-binders and background binders. The right stringency in the 
washing step can also be used to remove low-affinity specific binders. However, the 
washing step will also remove wanted binders if too harsh conditions are used. 

A blocking step, such as incubation of solid phase with skimmed milk proteins or 
other inert proteins and/or mild detergent such as Tween-20 and Triton X-100, may 
. also be used to reduce the background. The washing conditions should be as strin- 
gent as possible to remove background binding but to retain specific binders that 
interact with the immobilized target Generally, washing conditions are adjusted to 
maintain the desired affinity binders, e.g. binders in the micromolar, nanomolar, or 
pocomolar range. 

In traditional elution protocols, false positives due to suboptimal binding and wash- 
ing conditions are difficult to circumvent and may require elaborate adjustments of 
experimental conditions. However, an enrichment of more than 100 to 1000 is rarely 
obtained. The present invention alleviates the problem with false positive being ob- 
tained because the non-specific binding complexes to a large extent remain in the 
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reaction chamber. The experiments reported herein suggest that an enrichment of 
more than 1 0 7 can be obtained. 

The target can be any compound of interest E.g. the target.can be aprotein, pep- 
tide/carbohydrate, polysaccharide, glycoprotein, hormone, receptor, antigen, anti- 
body, virus, substrate, metabolite, transition state analogue, cofactor, inhibitor, drug, 
dye, nutrient, growth factor, cell, tissue, etc. without limitation. Suitable targets in- 
clude, but are not limited to, angiotensin converting enzyme, renin, cyclooxygenase, 
5-lipoxygenase, JIL- 1 0 converting enzyme, cytokine receptors, PDGF receptor, 
* type II iriosine monophosphate dehydrogenases-lactamases.' integrin, proteases 
like factor Vila, kinases like Bcr-Abl/Her, phosphotases like PTP-1 B, and fungal cy- 
tochrome P-450. Targets can include, but are not limited to, bradykinin, neutrophil 
elastase. the HIV proteins, including tat, rev, gag, int. RT, nucleocapsid etc., VEGF, 
bFGF, TGFp, KGF, PDGF, GPCR thrombin, substance P, IgE, sPLA2, red blood 
cells, glioblastomas, fibrin clots, PBMCs, hCG, lectins, selectins, cytokines, ICP4, 
complement proteins, etc. 

A target can also be a surface of a non-biological origin, such as a polymer surface 
or a metal surface. The method of the invention may then be used to identify suit- 
able coatings for such surfaces. 

In a preferred embodiment, the desirable synthetic molecule acts on the target with- 
out any interaction between the nucleic acid attached to the desirable encoded 
molecule and the target In one embodiment, the bound complex-target aggregate 
can be partitioned from unbound complexes by a number of methods. The methods 
include nitrocellulose filter binding, column chromatography, filtration, affinity chro- 
matography, centrifugation, and other well known methods. A preferred method is 
size-exclusion chromatography. 

Briefly, the library of complexes is subjected to the target, which may include contact 
between the library and a column onto which the target is immobilised. Identifiers 
associated with undesirable synthetic molecules, i.e. synthetic molecules not bound 
to the target under the stringency conditions used, will pass through the column. 
Additional undesirable synthetic molecules {e.g. synthetic molecules which cross- 
react with other targets) may be removed by counter-selection methods. Desirable 
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complexes are bound to the column. The target may be immobilized in a number of 
ways. In one embodiment, the target is immobilized through a cleavable physical 
link, such as one more chemical bonds. The aggregate of the target and the com- 
plex may then be subjected to a size exclusion chromatography to separate the ag- 
gregate from the rest of the compounds in the media. The complex may then be 
eluted from the target by changing the conditions (e.g., salt, pH, surfactant, tem- 
perature etc.). Alternatively, the complex may be provided with a cleavable linker, 
preferable orthogonal to the cleavable linker that attached the target to the solid 
support, at a position between the synthetic molecule and the identifier. Subsequent 
to the size exclusion chromatography this cleavable linker is cleaved to separate the 
identifiers of complexes having affinity towards the targets. Just to mention a single 
type of orthogonal cleavable linkages, one could attached to target to the solid sup- 
port through a linkage that can be cleaved by a chemical agent, and the linker sepa- 
rating the synthetic molecule and the identifier may be selected as a photocleavable 
linkage. More specifically, the former linkage may be a disulphide bond that can be 
cleaved by a suitable reducing agent like DTT (dithiothreitol) and the latter linkage 
may be a o-nitrophenyl group. 

There are other partitioning and screening processes which are compatible with this 
invention that are known to one of ordinary skill in the art. In one embodiment, the 
products can be fractionated by a number of common methods and then each frac- 
tion is then assayed for activity. The fractionization methods can include size, pH, 
hydrophobicity, etc 

Inherent in the present method is the selection of encoded molecules on the basis of 
a desired function; this can be extended to the selection of molecules with a desired 
function and specificity. Specificity can be required during the selection process by 
first extracting complexes which are capable of interacting with a non-desired "tar- 
get- (negative selection, or counter-selection), followed by positive selection with the 
desired target As an example, inhibitors of fungal cytochrome P-450 are known to 
cross-react to some extent with mammalian cytochrome P-450 (resulting in serious 
side effects). Highly specific inhibitors of the fungal cytochrome could be selected 
from a library by first removing those complexes capable of interacting with the 
mammalian cytochrome, followed by retention of the remaining products which are 
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capable of interacting with the fungal cytochrome. . 
Cleavable (inkers 

A deavable linker may be positioned between the target and a solid support or be- 
5 tween the potential drug candidate and the identifier or any other position that may 
ensure a separation of the identifier from successful complexes from non-specific 
binding complexes. The cleavable linker may be selectively cleavable, i.e. conditions 
may selected that only cleave that particular linker. 

0 The cleavable linkers may be selected from a large plethora of chemical structures. 
Examples of linkers includes, but are not limited to, linkers having an enzymatic 
cleavage site, linkers comprising a chemical degradable component, linkers 
cleavable by electromagnetic radiation. 



5 Examples of linkers cleavable by electromagnetic radiation 
o-nitrobenzyl p-alkoxy 



R 3 

hv 




X- 



t 



NO2 R 2 hv 



o-nitrobenzyl in exo position 
NO2 R 2 

For more details see Holmes CP. J. Org. Chem. 1997, 62, 2370-2380 
3-nitrophenyloxy 

/ =B 'j OH OH 
For more details see Rajasekharan.Pillai, V. N. Synthesis. 1980, 1-26 
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Dansyl derivatives: 

o=s=o 




For more details see Rajasekharan Pillai, V. N. Synthesis. 1980, 1-26 

■ i 

Coumarin derivatives 
„NR 2 R 3 



hv , H— NR 2 R 3 




H-Donor 



For more details see R. O. Schoenleber, B. Giese. Synlett 2003, 501-504 

R 1 and R 2 can be either of the potential drug candidate and the identifier, respec- 
tively. Alternatively, R 1 and R 2 can be either of the target or a solid support, respec- 
tively. 

R 3 = HorOCH 3 

If X is O then the product will be a carboxylic acid 
If X is NH the product will be a carboxamide 

One specific example is the PC Spacer Phosphoramidite (Glen research catalog # 
10-4913-90) which can be introduced in an oligonucleotide during synthesis and 
cleaved by subjecting the sample in water to UV light (~ 300-350 nm) for 30 sec- 
onds to 1 minute. 
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O /-°~P-N(iPr) 2 



DMT = 4,4-Dimethoxytntyl 
. iPr = Isopropyl 
CNEt = Cyanoethyl 



The above PC spacer phosphoamidite is suitable incorporated in a library of com- 
plexes at a position between the indentifier and the potential drug candidate. The 
spacer may be cleaved according to the following reaction. 

R 1 and R 2 can be either of the encoded molecule and the identifying molecule, re- 
spectively. In a preferred aspect R 2 is an oligonucleotide identifier and the R 1 is the 
potential drug candidate. When the linker is cleaved a phosphate group is generated 
allowing for further biological reactions. As an example, the phosphate group may 
be positioned in the 5 end of an oligonucleotide allowing for an enzymatic ligation 
process to take place. 

Examples of linkers cfeavable by chemical agents: 



Ester linkers can be cleaved by nucleophilic attack using e.g. hydroxide ions. In 
practice this can be accomplished by subjecting the target-ligand complex to a base 
for a short period. 



WO 2005/003778 



R 3 R 4 




33 



OH- O 

— - *Jk 



OH 



PCT/DK2004/000466 



f? R 4 




R 5 R 6 



10 



20 



R 1 and R 2 can be the either of be the potential drug candidate or the identifier, re- 
spectively. R 4 * can be any of the following: H, CN, F, N0 2 , S0 2 NR 2 . 

Disulfide linkers can efficiently be cleaved / reduced by Tris (2-carboxyethyl) phos- 
phine (TCEP). TCEP selectively and completely reduces even the most stable wa- 
ter-soluble alky! disulfides over a wide pH range. These reductions frequently re- 
quired less than 5 minutes at room temperature. TCEP is a non-volatile and odor- 
less reductant and unlike most other reducing agents, it is resistant to air oxidation. 
Trialkylphosphines such as TCEP are stable in aqueous solution, selectively reduce 
disulfide bonds, and are essentially unreactive toward other functional groups com- 
monly found in proteins. 



15 



R^S-S-R 2 



j Q^OH 



H*0 



R^SH * HS-R* 



More details on the reduction of disulfide bonds can be found in Kirley, T.L.(1989), 
Reduction and fluorescent labeling of cyst(e)ine-containing proteins for subsequent 
structural analysis, Anal. Biochem. 180, 231 and Levison, M.E., etai. (1969), Re- 
duction of biological substances by water-soluble phosphines: Gamma-globulin. 
Experentia 25, 126-127. 



25 



Linkers cleavable by enzymes 

The linker connecting the potential drug candidate with the identifier or the solid 
support and the target can include a peptide region that allows a specific cleavage 
using a protease. This is a well-known strategy in molecular biology. Site-specific 
• proteases and their cognate target amino acid sequences are often used to remove 
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the fusion protein tags that facilitate enhanced expression, solubility, secretion or • 
purification of the fusion protein. 

Various proteases can be used to accomplish a specific cleavage. The specificity is 
especially important when the cleavage site is presented together with other se- 
5 quences such as for example the fusion proteins. Various conditions have been op- 
timized in order to enhance the cleavage efficiency and control the specificity. These 
conditions are available and know in the art. 

Enterokinase is one example of an enzyme (serine protease) that cut a specific 
10 amino acid sequence. Enterokinase recognition site is Asp-Asp-Asp-Asp-Lys 

(DDDDK), and it cleaves C-terminally of Lys. Purified recombinant Enterokinase is 
commercially available and is highly active over wide ranges in pH (pH 4.5-9.5) and 
temperature (4-45°C). 

15 The nuclear inclusion protease from tobacco etch virus (TEV) is another commer- 
cially available and well-characterized proteases that can be used to cut at a specific 
amino acid sequence. TEV protease cleaves the sequence Glu-Asn-Leu-Tyr-Phe- 
Gln-Gly/Ser (ENLYFQG/S) between Gln-Gly or Gln-Ser with high specificity. 

20 Another well-known protease is thrombin that specifically cleaves the sequence Leu- 
Val-Pro-Arg-Gly-Ser (LVPAGS) between Arg-Gly. Thrombin has also been used for 
cleavage of recombinant fusion proteins. Other sequences can also be used for 
thrombin cleavage; these sequences are more or less specific and more or less effi- 
ciently cleaved by thrombin. Thrombin is a highly active protease and various reac- 

25 tion conditions are known to the public. 

Activated coagulation factor FX (FXa) is also known to be a specific and useful pro- 
tease. This enzyme cleaves C-terminal of Arg at the sequence lle-Glu-Gly-Arg 
(IEGR). FXa is frequently used to cut between fusion proteins when producing pro- 
30 teins with recombinant technology. Other recognition sequences can also be used 
for FXa. 

Other types of proteolytic enzymes can also be used that recognize specific amino 
acid sequences. In addition, proteolytic enzymes that cleave amino acid sequences 
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in an un-specific manner can also be used if only the linker contains ah amino add 
sequence in the complex molecule. 

Other type of molecules such as ribozymes. catalytically active antibodies, or lipases 
can also be used. Tne only prerequisite is that the catalytically active molecule can 
cleave the specific structure used as the linker, or as a part of the linker, that con- 
nects the encoding region and the displayed molecule or, in the alternative the solid 
support and the target 

A variety of endonucleases are available that recognize and cleave a double 
stranded nucleic acid having a specific sequence of nucleotides. The endonuclease 
Eco Rl is an example of a nuclease that efficiently cuts a nucleotide sequence linker 
comprising the sequence GAATTC also when this sequence is close to the nucleo- 
tide sequence length. Purified recombinant Eco Rl is commercially available and is 
highly active in a range of buffer conditions. As an example the Eco Rl is working in 
in various protocols as indicted below (NEBufferis available from New England 
Biolabs): 

NEBuffer 1 : [10 mM Bis Tris Propane-HCI, 10 mM MgCI 2 . 1 mM dithiothreitol (pH 
7.0 at 25°C)], 

NEBuff*2:[50mMNaCU0mMT^ 
7.9 at 25°C)], 

NEBuffer 3 : [100 mM NaCI, 50 mM Tris-HCI, 10 mM MgCI 2 , 1 mM dithiothreitol (pH 
7.9 at 25 S C)J, " 

NEBuffer 4 : [50 mM potassium acetate. 20 mM Tris-acetate, 1 0 mM magnesium 
acetate, 1 mM dithiothreitol (pH 7.9 at 25°C)]. 

Extension buffer : mM KCI, 20 mM Tris-HCI(Ph 8.8 at 25° C), 10 mM (NH 4 ) 2 S0 4 , 2 
mM MgSO 4 and 0.1% Triton X-100, and 200 pM dNTPs. 



Determining the identifier sequence 

The nucleotide sequence of the identifier sequence present in the isolated Afunc- 
tional molecules or the separated identifiers is determined to identify the chemical 
entities that participated in the binding interaction. The synthesis method of the 
synthetic molecule may be established if information on the chemical entities as well 
as the point in time they have been incorporated in the synthetic molecule can be 
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deduced from the identifier: It may be sufficient to get information on the chemical 
structure of the various chemical entities that have participated in the synthetic 
molecule to deduce the full molecule due to structural constraints during the forma- 
tion. As an example, the use of different kinds of attachment chemistries may en- 
sure that a chemical entity on a building block can only be transferred to a single 
position on a scaffold. Another kind of chemical constrains may be present due to 
steric hindrance on the scaffold molecule or the chemical entity to be transferred. In 
general however, it is preferred that information can be inferred from the identifier 
sequence that enable the identification of each of the chemical entities that have 
10 ' participated in the formation of the encoded molecule along with the point in time in 
the synthesis history the chemical entities have been incorporated in the (nascent) 
synthetic molecule.' 

Although conventional DNA sequencing methods are readily available and useful for 
this determination, the amount and quality of isolated Afunctional molecule may 
require additional manipulations prior to a sequencing reaction. 

Where the amount is low, it is preferred to increase the amount of the identifier se- 
quence by polymerase chain reaction (PCR) using PCR primers directed primer 
binding sites present in the identifier sequence. 

In addition, the quality of the isolated Afunctional molecule may be such that multi- 
ple species of Afunctional molecule are co-isolated by virtue of similar capacities for 
binding to the target In cases where more than one species of Afunctional molecule 
are isolated, the different isolated species must be separated prior to sequencing of 
the identifier oligonucleotide. 

Thus in one embodiment, the different identifier sequences of the isolated Afunc- 
tional complexes are cloned into separate sequencing vectors prior to determining 
their sequence by DNA sequencing methods. This is typically accomplished by am- 
plifying all of the different identifier sequences by PCR as described herein, and 
then using a unique restriction endonuclease sites on the amplified product to direc- ' 
tionally clone the amplified fragments into sequencing vectors. The cloning and se- 
quencing of the amplified fragments then is a routine procedure that can be carried 
out by any of a number of molecular biological methods known in the art 
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Alternatively, the bifunctonal complex or the PCR amplified identifier sequence can 
be analysed in a microarray. The array may be designed to analyse the presence of 
a single codon or multiple codons in a identifier sequence. 

Synthesis of nucleic adds 

Oligonucleotides can be synthesized by a variety of chemistries as is well known. 
For synthesis of an oligonucleotide on a substrate in the direction of 3* to 5', a free " 
hydroxy terminus is required that can be conveniently blocked and deblocked as 
needed. A preferred hydroxy terminus blocking group is a dimexothytrityl ether 
(DMT). DMT blocked termini are first deblocked, such as by treatment with 3% di- 
chloroacetic add in dichloromethane (DCM) as is well known for oligonucleotide 
synthesis, to form a free hydroxy terminus. 

Nucleotides in precursor form for addition to a free hydroxy terminus in the direction 
of 3* to 5' require a phosphoramidate moiety having an aminodiisopropyl side chain 
at the 3' terminus of a nucleotide. In addition, the free hydroxy of the phosphorami- 
date is blocked with a cyanoethyl ester (OCNET), and the 5' terminus is blocked 
with a DMT ether. The addition of a 5' DMT-, 3' OCNET-blocked phosphoramidate 
.nucleotide to a free hydroxyl requires tetrazole in acetonitrile followed by iodine oxi- 
dation and capping of unreacted hydroxyls with acetic anhydride, as is well known 
for oligonucleotide synthesis. The resulting product contains an added nucleotide 
residue with a DMT blocked 5' terminus, ready for deblocking and addition of a sub- 
sequent blocked nucleotide as before. 

For synthesis of an oligonucleotide in the direction of 5' to 3\ a free hydroxy termi- 
nus on the linker is required as before. However, the blocked nucleotide to be added 
has the blocking chemistries reversed on its 5' and 3' termini to facilitate addition in 
the opposite orientation. A nucleotide with a free 3' hydroxyl and 5' DMT ether is first ■' 
blocked at the 3' hydroxy terminus by reaction with TBS-CI in imidazole to form a 
TBS ester at the 3' terminus. Then the DMT-blocked 5* terminus is deblocked with. 
DCA in DCM as before to form a free 5' hydroxy terminus. The reagent (N.N- 
diisopropylamino)(cyanoethyl) phosphonamidic chloride having an aminodiisopropyl 
group and an OCNET ester is reacted in tetrahydrofuran (THF) with the 5' de- 
blocked nucleotide to form the aminodiisopropyl-, OCNET-blocked phosphonami- 
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date group on the 5* terminus. Thereafter the 3* TBS ester is removed with tetrabu- 
tylammonium fluoride (TBAF) in DCM to form a nucleotide with the phosphonami- 
date-blocked 5' terminus and a free 3' hydroxy terminus. Reaction in base with 
DMT-CI adds a DMT ether blocking group to the 3' hydroxy terminus. 

The addition of the 3' DMT-, S OCNET-blocked phosphonamidated nucleotide to a 
linker substrate having a free hydroxy terminus then proceeds using the previous 
tetrazole reaction, as is well known for oligonucleotide polymerization. The resulting 
product contains an added nucleotide residue with a DMT-blocked 3* terminus, 
ready for deblocking with DCA in DCM and the addition of a subsequent blocked 
nucleotide as before. 

Extension and amplification 

The use of the polymerase chain reaction (PCR) is a preferred embodiment, for the 
production of the identifiers using the nucleic acids of the selected complexes as 
identifiers. 

For use in this invention, the identifier sequences are preferably comprised of poly- 
nucleotide coding strands, such as mRNA and/or the sense strand of genomic DNA 
or non-natural nucleic acids, like TNA and LNA which may be used as template for a 
polymerase. If the genetic material to be processed is in the form of double stranded 
nucleic acid, it is usually first denatured, typically by melting, into single strands. The 
nucleic acid is subjected to a PCR reaction by treating (contacting) the sample with 
a PCR primer pair, each member of the pair having a preselected nucleotide se- 
quence. The PCR primer pair is capable of initiating primer extension reactions by 
hybridizing to the PCR primer binding site on identifier oligonucleotide, preferably at 
least about 10 nucleotides in length, more preferably at least about 12 nucleotides in 
length. The first primer of a PCR primer pair is sometimes referred to as the "anti- 
sense primer because it is extended into a non-coding or anti-sense strand of a 
nucleic add, i.e., a strand complementary to a coding strand. The second primer of 
a PCR primer pair is sometimes referred to as the "sense primer" because it is ad- 
joined with the coding or sense strand of a nucleic acid. 



The PCR reaction is performed by mixing the PCR primer pair, preferably a prede- 
termined amount thereof, with the nucleic acids of the sample, preferably a prede- 
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termined amount thereof, in a PCR buffer to form a PCR reaction admixture. The 
admixture is thermocycled for a number of cycles, which is typically predetermined, 
sufficient for the formation of a PCR reaction product, thereby amplifiyng the identifi- 
ers in the isolated complex. 

PCR is typically carried out by thermocycling i.e., repeatedly increasing and de- 
creasing the temperature of a PCR reaction admixture within a temperature range 
whose lower limit is about 30 degrees Celsius (30° C.) to about 55° C. and whose 
upper limit is about 90° C. to about 100° C. The increasing and decreasing can be 
continuous, but is preferably phasic with time periods of relative temperature stability 
at each of temperatures favoring polynucleotide synthesis, denaturation and hybridi- 
zation. 

A plurality of first primer and/or a plurality of second primers can be used in each 
amplification, e.g., one species of first primer can be paired with a number of differ- 
ent second primers to form several different primer pairs. Alternatively, an individual 
pair of first and second primers can be used. In any case, the amplification products 
of amplifications using the same or different combinations of first and second prim- 
ers can be combined for assaying for mutations. 

The PCR reaction is performed using any suitable method. Generally it occurs in a 
buffered aqueous solution, i.e., a PCR buffer, preferably at a pH of 7-9, most pref- 
erably about 8. Preferably, a molar excess of the primer is admixed to the buffer 
containing the identifier strand. A large molar excess is preferred to improve the 
efficiency of the process. 

The PCR buffer also contains the deoxyribonucieotide triphosphates (polynucleotide 
synthesis substrates) dATP, dCTP, dGTP, and dTTP and a polymerase, typically 
thermostable, ail in adequate amounts for primer extension (polynucleotide synthe- 
sis) reaction. The resulting solution (PCR admixture) is heated to about 90" C.-100 0 
C for about 1 to 1 0 minutes, preferably from 1 to 4 minutes. After this heating period 
the solution is allowed to cool to a primer hybridization temperature. The synthesis 
reaction may occur at from room temperature up to a temperature above which the 
polymerase (inducing agent) no longer functions efficiently. Thus, for example, if 
DNA polymerase is used as inducing agent, the temperature is generally no greater 
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than about 40* C. The thermocycling is repeated until the desired amount of PGR 
product is produced. An exemplary PCR buffer comprises the following: 50 mM KCI; 
10 mM tris-HCI at pH 8.3; 1.5 mM MgCI2 ; 0.001% (wt/vol) gelatin, 200 uM dATP; 
200 pM dTTP; 200 pM dCTP; 200 pM dGTP; and 2.5 units Thermus aquaticus (Taq) 
5 DNA polymerase I (U.S. Pat. No. 4,889,818) per 100 microliters (pi) of buffer. 

The inducing agent may be any compound or system which will function to accom- 
plish the synthesis of primer extension products, including enzymes. Suitable en- 
zymes for this purpose include, for example, E. coli DNA polymerase I, Klenow 
) fragment of E. coli DNA polymerase I, T4 DNA polymerase, Taq DNA polymerase, 
Pfu polymerase, Vent polymerase, HIV-1 Reverse Transcriptase, other available 
DNA polymerases, reverse transcriptase, and other enzymes, including heat-stable 
enzymes, which will facilitate combination of the nucleotides in the proper manner to 
form the primer extension products which are complementary to each nucleic acid 
strand. Generally, the synthesis will be initiated at the 3' end of each primer and 
proceed in the 5 1 direction along the identifier strand, until synthesis terminates, pro- 
ducing molecules of different lengths. There may be inducing agents, however, 
which initiate synthesis .at the 5* end and proceed in the above direction, using the 
same process as described above. 

The inducing agent also may be a compound or system which will function to ac- 
complish the synthesis of RNA primer extension products, including enzymes. In 
preferred embodiments, the inducing agent may be a DNA-dependent RNA poly- 
merase such as T7 RNA polymerase, T3 RNA polymerase or SP6 RNA polymerase. 
These polymerases produce a complementary RNA polynucleotide. The high turn- 
over rate of the RNA polymerase amplifies the starting polynucleotide as has been 
described by Chamberlin et at., The Enzymes, ed. P. Boyer, pp. 87-108, Academic 
Press, New York (1982). Amplification systems based on transcription have been 
described by Gingeras et aL, in PCR Protocols, A Guide to Methods and Applica- 
tions, pp. 245-252, Innis et a!., eds, Academic Press, Inc., San Diego. Calif. (1990). 

If the inducing agent is a DNA-dependent RNA polymerase and, therefore incorpo- 
rates ribonucleotide triphosphates, sufficient amounts of ATP, CTP, GTP and UTP 
are admixed to the primer extension reaction admixture and the resulting solution is 
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treated as described above. 

The newly synthesized strand and its complementary nucleic add strand form a 
double-stranded molecule which can be used in the succeeding steps of the 
5 method. PCR amplification methods are described in detail in U.S. Pat Nos. 

4,683,192, 4,683,202, 4,800;159, and 4,965,188, and at least in several texts in- 
cluding PCR Technology: Principles and Applications for DNA Amplification, H. Er- 
lich, ed.. Stockton Press, New York (1989); and PCR Protocols: A Guide to Methods 
and Applications, Innis et al., eds., Academic Press, San Diego, Calif. (1990). The 
1 0 term "primer" as used herein refers to a polynucleotide whether purified from a nu- 
cleic add restriction digest or produced synthetically, which is capable of acting as a 
point of initiation of nudeic add synthesis when placed under conditions in which 
synthesis of a primer extension product which is complementary to a nudeic add 
strand is induced, i.e., in the presence of nudeotides and an agent for polymeriza- 
15 tion such as DNA polymerase, reverse transcriptase and the like, and at a suitable 
temperature and pH. The primer is preferably single stranded for maximum effi- 
ciency, but may alternatively be in double stranded form. If double stranded, the 
primer is first treated to separate it from its complementary strand before being used 
to prepare extension products. Preferably, the primer is a polydeoxyribonudeotide. 
50 The primer must be sufficiently long to prime the synthesis of extension products in 
the presence of the agents for polymerization. The exact lengths of the primers will 
depend on many factors, including temperature and the source of primer. For exam- 
ple, depending on the complexity of the target sequence, a polynucleotide primer 
typically contains 1 0 to 25 or more nucleotides, although it can contain fewer nu- 
5 cleotides. Short primer molecules generally require cooler temperatures to form suf- 
ficiently stable hybrid complexes with identifier. 

The primers used herein are selected to be "substantially" complementary to the 
different strands of each specific sequence to be synthesized or amplified. This 
means that the primer must be sufficiently complementary to non-randomly hybrid- 
ize with its respective identifier strand. Therefore, the primer sequence may or may 
not reflect the exact sequence of the identifier. For example, a non-complementary 
nudeic add can be attached to the 5' end of the primer, with the remainder of the 
primer sequence being substantially complementary to the strand. Such non- 
complementary fragments typically code for an endonuclease restriction site or used 
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Primers of the present invention may also contain a DNA-dependent RNA polymer- 
ase promoter sequence or its complement See for example, Krieg et al., Nucl. A- 
5 cids Res., 12:7057-70 (1984); Studier et al. t J. Mol. Biol., 189:113-130 (1986); and 
Molecular Cloning: A Laboratory Manual, Second Edition, Maniatis et al., eds., Cold 
Spring Harbor, N.Y. (1989). 

When a primer containing a DNA-dependent RNA polymerase promoter is used, the 
1 0 primer is hybridized to the polynucleotide strand to be amplified and the second 
polynucleotide strand of the DNA-dependent RNA polymerase promoter is com- 
pleted using an inducing agent such as E. coli DNA polymerase I, or the Klenow 
fragment of E. coli DNA polymerase. The starting polynucleotide is amplified by al- 
ternating between the production of an RNA polynucleotide and DNA polynudeo- 
1 5 tide. This may be used for selective degradation of the RNA strand, which is prone 
to disintegration upon treatment with a strong base. 

Primers may also contain a identifier sequence or replication initiation site for a 
RNA-directed RNA polymerase. Typical RNA-directed RNA polymerase include the ' 
QB replicase described by Lizardi eta!., Biotechnology. 6:1197-1202 (1988). RNA- 
directed polymerases produce large numbers of RNA strands from a small number 
of identifier RNA strands that contain a identifier sequence or replication initiation 
site. These polymerases typically give a one million-fold amplification of the identifier 
strand as has been described by Kramer et al., J. Mol. Biol., 89:719-736 (1974). 

In one embodiment, the present invention utilizes a set of polynucleotides that form 
primers having a priming region located at the 3-terminus of the primer. The 3- 
terminal priming portion of each primer is capable of acting as a primer to catalyze 
nucleic acid synthesis, Le., initiate a primer extension reaction off its 3' terminus. 
One or both of the primers can additionally contain a S'-terminal non-priming portion, 
i.e., a region that does not participate in hybridization to the preferred identifier The 
5 -part of the primer may be labelled as described herein above. 

Brief Description of the Figures 

Fig. 1 a and 1 b shows a picture of a gel resulting from example 1 
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Fig. 2 discloses a picture of a gel resulting from example 2. 
Fig. 3 depicts a picture of gel which shows the results of example 3. 
Rg. 4 discloses a schematic representation of two embodiments of the present in- 
vention. 

Detailed Description of the drawings 

Fig. 4 discloses details of the present invention. Initially, a library of complexes 
(frame 1) is provided. The potential drug candidate is represented by a circle and 
the identifier is indicated by a rectangle. The straight line between the identifier and 
10 the potential drug candidate illustrate a linkage which in some embodiments is 
cleavable and in other embodiments are durable under the conditions used. The 
potential drug candidate and the identifier are filled with a suitable pattern to illus- 
trate that the individual complexes are different from one another. 

15 In process step A. the library is subjected to a target in solution (frame 2). The part 
of the library which has affinity towards the target will bind to the target The symbol 
P indicate a selectively cleavable linker, e.g. a PC spacer, cleavable by exposure to 
light of a certain wave length. Following process step I, the target may be immobi- 
lized to a solid support indicated by the container wall utilizing a linkage a (frame 3). 
The linkage a may be selectively cleavable or a durable linker. Preferably, in proc- 
ess step C, The non-binding part of the library is removed and the aggregate of the 
complex and the target is recovered (frame 4). The recovery of the aggregate of 
target and complex when in solution may be performed by size-exclusion chroma- 
tography. 

After the recovery of the aggregate of complex and target, the target may be immo- 
bilized following process path I (frame 5). Preferably, using process step E, the link- 
age p is cleaved thus separating the potential drug candidate from the identifier 
(frame 6). After the separation, the identifier is isolated to decode the identity of the 
potential drug candidate. Prior to the isolation, the target bound to the potential drug 
candidate may be immobilized to facilitate the isolation. Optionally, the isolated 
identifier (frame 8) is amplified by e.g. PCR prior to the identification step. 

Following the route starting with process step B, the library of complexes is sub- 
jected to an immobilized target Complexes having affinity towards the target will 
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bind thereto (frame 3). In effect, the complex is immobilized. The non-binding com- 
plexes are washed away in process step D and the immobilized complex-target- 
aggregate is isolated (frame 5). In the event the linkage a is selectively cleavable, 
the process step I may be followed to form the isolated aggregate in frame 4. The 
5 identifier may be cleaved off and the decoded to identify the identity of the potential 
drug candidate. When the linkages a and p are both selectively cleavable, they are 
preferable orthogonal cleavable, i.e. the conditions which cleave the linkage a does 
not cleave linkage p and visa versa. 

10 In the event the linker a is durable, i.e. not cleaved under the conditions used, the 
process step F is followed. The immobilized target is maintained connected to the 
solid support while the identifier is cleaved to the potential drug candidate (frame 7). 
The identifier is subsequently isolated using process step H (frame 8). Prior to de- 
coding the identifier may be amplified by suitable process to obtain an appropriate 

15 amount of genetic material. 

Examples 

Example 1: E nrichment of biotin tagged DNA identifier. 

20 1 00 pico mol of DNA identifier T1 of the sequence (GAGCGGATGCGTACATCTTGTA- 
CATGTCAATG CG ATCG ATCG ACTG ATCAAGATAGACC GAGTGC CG) was annealed to 
50 pico mol of a biotin tagged primer F1-PC-B (Biotin-PC- 
CGGCACTCGGTCTATCTT). PC is a photo cleavable group and is obtainable as a 
phosphoramidite from Glen Research (Products cat# 10-4913). Subsequently the 

25 F1-PC-B primer was extended by DNA polymerase (Sequenase from Upstate Bio- 
technology Cat# 70775Y). Likewise 50 pico mol of a DNA identifier T2 

(GAGCGGATGCGTACATCTACGATGGATGCTCCAGGTCGCAAGATAGAC- 
CGAGTGCCG) was annealed to 1 00 pico mol of a primer F1 
(CGGCACTCGGTCTATCTT) and extended by DNA polymerase. This generated 
30 two pools of double stranded DNA sequences each containing 50 pico mol of the 
following complexes, one of length 67 nucleotides (termed T1+B) and one of length 
57 nucleotides termed (T2-B), 
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Enrichment of the biotin tagged T1+B complex was performed in Reacti-Bind™ 
streptavidin coated clear strip' plates (Pierce Biotechnology, Cat # 15120). The two 
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pools of double stranded DNA, T1+B and T2-B where mixed in ratios 1 pmol of 
T1+B complex and 1 pmol of T2-B complex, 1/10 pmol of TUB complex and 1 pmol 
of T2-B complex, 1/1 00 pmol of T1 +B complex and 1 pmol of T2-B complex. 1/1 .000 
pmol of T1+B complex and 1 pmolofT2-B complex. 1/10.000 pmol ofT1+Bcom- 
5 plex and 1 pmol of T2-B complex. 1/1 00.000 pmoi of T1 +B complex and 1 pmol of 
T2-B complex and 1/1.000.000 pmol of T1+B complex and 1 pmol of T2-B complex. 
All where mixed in 1 00 ul buffer A (Tris buffered saline. 0.05% Tween 20, 1 % Bo- 
vine serum albumin, 0.1 mg/mL herring sperm DNA). Incubation in streptavidin 
coated wells was done for 30 min at 25*C. After ligand binding wells were washed 
1 0 30 times with 250 u.L buffer A during one hour. Thereafter 1 00 uL buffer A was ap- 
plied to each well apd the wells where exposed to UV light at 350 nano meters for 
30 seconds in order to cleave the PC spacer thereby releasing the T1 identifier from 
the biotin molecule. Following exposure to UV light the elution volume was removed 
immediately and analyzed for the presence of DNA strands T1 and T2 by polymer- 
15 ase chain reaction (PCR). 

The content of T1 and T2 following enrichment for biotin binding molecules was 
analysed by generating M P phosphate labeled PCR products of T1 and T2. Both T1 
and T2 PCR products where generated by the use of the two primers F1 

20 (CGGCACTCGGTCTATCTT) and R1 (GAGCGGATGCGTACATCT). R1 was la- 
beled with ^P phosphate by the use of T4 polynucleotide kinase from Promega 
(Cat# M4101). PCR was performed by adding 10 uX of the eluted volume together 
with 1 5 pi. of water containing 1 pico mole of both F1 and ^P-labeled R1 to puRe- 
Taq™ Ready-To-Go™ PCR Beads (Amersham Biosciences, Gat# 40751 3-96). PCR 

25 was performed for 30 cycles by annealing at 58*C and extending at 72"C. PCR gen- 
erated products where separated by urea containing polyacrylamide gel electropho- 
resis and visualized by radiography. 

Eight wells precoated with streptavidin were used for the enrichment experiment 
30 shown in Fig. 1A. Lane 1 in Fig 1 A. represents PCR amplification on eiuate from a 

well that was incubated with Buffer A without any identifier complexes. Lane 2 was 

incubated with Buffer A containing 1 pmol of T1 +B complex and 1 pmol of T2-B 
. complex. Lane 3 was incubated with Buffer A containing 1/10 pmol of T1 +B complex 

and 1 pmol of T2-B complex. Lane 4 was incubated with Buffer A containing 1/1 00 
35. pmol of T1+B complex and 1 pmol of T2-B complex. Lane 5 was incubated with 
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Buffer A containing 1/1,000 pmol of T1+B complex and 1 pmol of T2-B complex 
• Lane 6 was incubated with Buffer A containing 1/1 0,000 pmol of T1 +B complex and 
1 pmol of T2-B complex. Lane 7 was incubated with Buffer A containing 1/100,000 
pmol of T1+B complex and 1 pmol of T2-B complex. Lane 8 was incubated with 
5 Buffer A containing 1/1 jOOO.OOO pmol of T1 +B complex and 1 pmol of T2-B complex. 

Lane 9 in Fig. 1A represents PCR amplification of the input into well 2. Lane 10 rep- 
resents PCR amplification of the input into well 3. Lane 1 1 represents PCR amplifi- 
cation of the input into well 4. Lane 12 represents PCR amplification of the input into 
1 0 well 5. Lane 1 3 represents PCR amplification of the input into well 6. Lane 14 repre- 
sents PCR amplification of the input into well 7. Lane 1 5 represents PCR amplifica- 
tion of the input into well 8. Lane 16 represents PCR amplification of water (Blank). 

In Fig. 1B the PCR products represented in Fig. 1A were subjected to additional 30 
1 5 cycles of PCR amplification. This revealed T1 and T2 identifiers of very low abun- 
dance. Lane 1 represents further PCR amplification of PCR product as represented 
in Fig. 1B lane 1, Lane 2 represents further PCR amplification of PCR product as 
represented in Fig. 1 B lane 2. Lane 3 represents further PCR amplification of PCR 
product as represented in fig 1 lane 3. Lane 4 represents further PCR amplification 
20 of PCR product as represented in Fig. 1 B lane 4. Lane 5 represents further PCR 
amplification of PCR product as represented in Fig. 1B lane 5. Lane 6 represents 
further PCR amplification of PCR product as represented in Fig. 1 B lane 6. Lane 7 
represents further PCR amplification of PCR product as represented in Fig. 1 lane 7. 
Lane 8 represents further PCR amplification of PCR product as represented in Fig. 
25 1Blane8. 

Lane 9 of Fig. 1B represents PCR amplification of water (Blank). Lane 10 represents 
PCR amplification of the input into well 2 of Fig. 1B. Lane 1 1 represents PCR ampli- 
fication of the input into well 3 of Fig. 1 B. Lane 1 2 represents PCR amplification of 
30 the input into well 4 of Fig. 1B. Lane 1 3 represents PCR amplification of the input 
into well 5 of Fig. 1B. Lane 14 represents PCR amplification of the input into well 6 
of Fig. 1 B. Lane 15 represents PCR amplification of the input into well 7 of Fig. 1 B. 
Lane 16 represents PCR amplification of the input into well 8 of Fig. 1B. Lane 17 
represents PCR amplification of water (Blank). 

35 
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From Fig. 1A lane 6 it can be inferred that the double stranded DNA complex T1+B 
was enriched more than ten thousand fold over the T2-B complex. Further enrich- 
ment'can not be deduced from Fig. 1 A since the number of identifiers recovered and 
analyzed by PCR in lane 7 and 8 where to few to produce detectable bands after 30 
5 rounds of PCR. When subjecting the PCR products shown in Fig. 1B to an addi- 
tional 30 rounds of PCR the T2 product, in addition to the T1 product, appears in 
Fig. IB lanes 5, 6, 7 and 8. The T2 product becomes the dominating PCR product in 
lane 8 suggesting that enrichment of the T1 +B complex over the T2-B complex is 
between one hundred thousand and one million fold. 

10 

Example 2: Enrichment of di-nitro-phenoi tagged DNA identifier. 
100 pico mol Of a tfNA identifier, T1 (GAGCGGATGCGTACATCTTGTACATGTCA- 
ATGCGATCGATCGACTGATCAAGATAGACCGAGTGCCG) was annealed to 50 pico 
mol of a di-nitrophenol (DNP) tagged primer F1 -PC-DNP (DNP-PC- 

15 CGGCACTCGGTCTATCTT). PC is a photo cleavable group obtainable as a phos- 
phoramidite from Glen Research (Products cat# 10-4913). Subsequently the F1-PC- 
: DNP primer was extended by DNA polymerase (Sequenase from Upstate Biotech- 
nology Cat# 70775Y). Likewise 50 pico mol of a DNA Identifier T2 
(GAGCGGATGCGTACATCTACGATGGATGCTCCAGGTCGCAAGATAGAC- 

20 CGAGTGCCG) was annealed to 1 00 pico mol of a primer F1 

(CGGCACTCGGTCTATCTT) and extended by DNA polymerase. This generated 
two pools of double stranded DNA sequences each containing 50 pico mol of the 
following complexes, one of length 67 nucleotides termed T1+DNP and one of 
length 57 nucleotides termed T2-DNP. 

25 

Enrichment of the DNP tagged T1+DNP complex was performed by coating 1 jig 
anti DNP antibody in Nunc Immunomodule U8 Maxisorp (Biotecline cat # hun- 
47507). The two pools of double stranded DNA, T1 +DNP and T2-DNP where mixed 
in ratios 1 pmol of T1 +DNP complex and 1 pmol of T2-DNP complex, 1/1 0 pmol of 

30 T1 + DNP complex and 1 pmol of T2-DNP complex, 1/1 00 pmol of T1+DNP complex 
and 1 pmol of T2-DNP complex, 1/1 000 pmol of T1 +DNP complex and 1 pmol of 
T2-DNP complex, 1/10000 pmol of T1+DNP complex and 1 pmol of T2-DNP com- 
plex, 1/100000 pmol of T1+DNP complex and 1 pmol of T2-ONP complex and 
1/1 000000 pmol of T1 + DNP complex and 1 pmol of T2-DNP complex. All where 

35 mixed in 100 ui buffer A (Tris buffered saline, 0.05% Tween 20, 1 % Bovine serum 
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albumin, 0.1 mg/mL herring sperm DNA}. Incubation in anti-DNP antibody coated 
wells was done for 30 min at 25*C. After Hgand binding all wells were washed 30 
times with 250 \iL buffer A during one hour. Thereafter 100 \iL buffer A was applied 
to each well and the wells where exposed to UV light at 350 nano meters for 30 
.5 seconds in order to deave the PC spacer thereby releasing the T1 identifier from the 
DNP molecule. Following exposure to UV light the elution volume was removed im- 
mediately and analysis for the presence of DNA strands T1 and T2 by polymerase 
chain reaction (PCR). 

1 0 The content of T1 and T2 following enrichment for DNP containing molecules was 
analysed by generating M P phosphate labeled PCR products of T1 and T2. Both T1 
and T2 PCR products where generated by the use of the two primers F1 
(CGGCACTCGGTCTATCTT) and R1 (GAGCGGATGCGTACATCT). R1 was la- 
beled with ^P phosphate by the use of T4 polynucleotide kinase from Promega 
1 5 (Cat# M41 01 ). PCR was performed by adding 10 ul of the eluted volume together 
with 15 ul of water containing 1 pico mole of both F1 and ^P-labeled R1 to puRe- 
Taq™ Ready-To-Go™ PCR Beads (Amersham Biosciences, Cat# 40751 3-96). PGR 
was performed for 30 cycles by annealing at 58*C and extending at 72'C. The PCR 
products where diluted 100 fold and subjected to an additional 30 rounds of PCR 
amplification as described above. PCR generated products where separated by 
urea containing polyacrylamide gel electrophoresis and visualized by radiography. 

Eight wells coated with rabbit anti DNP antibody, 1 jig/well (DAKOCytomation # 
V0401) were used for the enrichment experiment shown in Fig. 2. Lane 1 in fig 2. 
represents PCR amplification of elutate from a well that was incubated with Buffer A 
without any identifier complexes. Lane 2 is PCR amplification of identifiers eluted 
from a well that was incubated with Buffer A containing 1 pmol of T1+DNP complex 
and 1 pmol of T2-DNP complex. Lane 3 is 1/10 pmol of T1+ DNP complex and 1 
pmol of T2-DNP complex. Lane 4 is 1/1 00 pmol of T1 +DNP complex and 1 pmol of 
T2-DNP complex. Lane 5 is 1/1000 pmol of T1+DNP complex and 1 pmol of T2- 
DNP complex. Lane 6 is 1/1 0000 pmol of T1 +DNP complex and 1 pmol of T2-DNP 
complex. Lane 7 is 1/100000 pmol of T1+DNP complex and 1 pmol of T2-DNP 
complex. Lane 8 is 1/1000000 pmol of T1+DNP complex and 1 pmol of T2-DNP 
complex. Lane 9 of fig 1 . represents PCR product of water for control. Lane 1 0 rep- 
resents PCR amplification of the input into the well that was subjected to selection 
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and analyzed by PCR in lane 2 of fig 2. Lane 1 1 represents PCR amplification of the 
input into the well analyzed in lane 3 of fig 2. Lane 12 represents PCR amplification 
of the input into the well analyzed in lane 4 of fig. 2. Lane 13 represents PCR ampli- 
fication of the input into the well analyzed m lane 5 of fig. 1. Lane 14 represents 
5 PCR amplification of the input into the well analyzed in Jane 6. of Fig. 2. Lane 1 5 rep- 
resents PCR amplification of the input into the well analyzed in lane 7 of Fig 2. Lane 
16 represents PCR amplification of the input into the well analyzed in lane 8 of Fig 2. 
Lane 1 7 represents PCR amplification of water for control. 

10 • From Fig. 2 lane 8 it can be concluded that the double stranded DNA complex 
T1+DNP was enriched more than one million fold over the T2-DNP complex. A 
weak T2-DNP band is appearing in lane 8 suggesting that enrichment of T1+DNP is 
considerably higher than one million fold. Products from both T1 and T2 in lane 1 
appear despite no input of T1 +DNP or T2-DNP complexes in this lane. This repre- 

15 sent cross contamination from handling; of identifiers resulting in weak product 

bands after two by 30 cycles of PCR. PCR product bands from T1 disappear in input 
lanes 13 to 16 since the T2 identifier is much in excess and take over the PCR reac- 
tion utilizing the 1 pico mole of both F1 and ^P-labeled R1 primers. This is also 
seen in the bottom of the gel where the ^P-labeled R1 primer is disappearing. 



Example 3: Enrichment of di-nitro-phenol tagged DNA identifier in the presence of a 
biotin tagged identifier. (A library of two displayed molecules! 
100 pico mol of a DNA identifier, T1 (GAGCGGATGCGTACATCTTGTACATGTCA- 
25 ATGCGATCGATCGACTGATCAAGATAGACCGAGTGCCG) was annealed to 50 pico 

mol of a biotin tagged primer F1-PC-B (Biotin-PC-CGGCACTCGGTCTATCTT). PC is 
a photo cleavable group obtainable as a phosphoramidite from Glen Research 
(Products cat# 10-4913). Subsequently the F1-PC-B primer was extended by DNA 
polymerase. (Sequenase from Upstate Biotechnology Cat# 70775Y). 

30 

Likewise, 100 pmol T2 (GAGCGGATGCGTACATCTACGATGGATGCTCCAG- 
GTCGCAAGATAGACCGAGTGCCG) was annealed to 50 pico mol of a di- 
nitrophenol (DNP) tagged primer F1-PC-DNP (DNP-PC-CGGCACTCGGTCTATCTT). 
Subsequently, the F1-PC-DNP primer was extended by DNA polymerase as de- 
35 scribed above. 
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This generated two pools of double stranded DNA sequences respectively contain- 
ing 50 pico mol of the following complexes, one of length 67 nucleotides termed 
T1+B and one of length 57 nucleotides termed T2+DNP. 

5 

Enrichment of the DNP tagged T2+DNP complex was performed by coating 1 
jig/well anti DNP antibody (DAKOCytomation # V0401 ) in Nunc Immunomodule U8 
Maxisorp (Biotecline cat # nun-47507). The two pools of double stranded DNA, 
T2+DNP and T1+B where mixed in ratios 1 pmol of T2+DNP complex and 1 pmol of 
10 T1+B complex, 1/10 pmol of T2+DNP complex and 1 pmol of T2+B complex, 1/100 
pmol of T2+DNP complex and 1 pmol of T1+B complex, 1/1000 pmol of T2+DNP 
complex and 1 pmol of T1+B complex, 1/10000 pmol of T2+DNP complex and 1 
pmol of T1+B complex, 1/100000 pmol of T2+DNP complex and 1 pmol of T1+B 
complex and 1/1000000 pmol of T2+DNP complex and 1 pmol of T1+B complex. All 
1 5 • where mixed in 100 ul buffer A (Tris buffered saline, 0.05% Tween 20, 1% Bovine 
serum albumin, 0.1 mg/mL herring sperm DNA). Incubation in anti-DNP antibody 
coated wells was done for 90 min at 25*C. After ligand binding all wells were 
washed 30 times with 250 \d. buffer A during one hour. Thereafter 100 ul buffer A 
was applied to each well and the wells where exposed to UV light at 350 nano me- 
0 ters for 30 seconds in order to cleave the PC spacer thereby releasing the T2 identi- 
fier from the DNP molecule. Following exposure to UV light the elution volume was 
removed immediately and analysis for the presence of DNA strands T1 and T2 by 
polymerase chain reaction (PCR). 

5 The content of T1 and T2 following enrichment for DNP containing molecules was 
analysed by generating M P phosphate labeled PCR products of T1 and T2. Both T1 
and T2 PCR products where generated by the use of the two primers F1 
(CGGCACTCGGTCTATCTT) and R1 (GAGCGGATGCGTAGATCT). R1 was la- 
beled with M P phosphate by the use of T4 polynucleotide kinase from Promega 
(Cat# M4101). PCR was performed by adding 10 \lL of the eluted volume together 
with 15 pi. of water containing 1 pico mole of both F1 and ^P-labeled R1 to puRe- 
Taq™ Ready-To-Go™ PCR Beads (Amersham Biosciences, CaW 40751 3-96). PGR 
was performed for 30 cycles by annealing at 58'C and extending at 72*C. PCR gen- 
erated products where separated by urea containing polyacrylamide gel electropho- 
resis and visualized by radiography. 
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Eight wells coated with rabbit anti DNP antibody. 1 jig/well were used for the en- 
richment experiment shown in Fig. 3: Lane 1 in fig 3. represents PCR amplification 
of elutate from a well that was incubated with Buffer A without any identifier com- 
5 plexes. Lane 2 is PCR amplification of identifiers eluted from a well that was incu- 
bated with Buffer A containing 1 pmol of T2+DNP complex and 1 pmol of T1+B 
complex. Lane 3 is 1/1 0 pmol of T2+DNP complex and 1 pmol of T1 +B complex. 
Lane 4 is 1/100 pmol of T2+DNP complex and i pmol of T1+B complex. Lane 5 is 
1/1000 pmol of T2+DNP complex and 1 pmol of T1+B complex. Lane 6 is 1/10000 
10 * pmol of T2+DNP complex and 1 pmol of T1+B complex. Lane 7 is 1/100000 pmol of 
T2+DNP complex and 1 pmol of T1+B complex. Lane 8 is 1/1000000 pmol of 
T2+DNP complex and 1 pmol of T1+B complex. Lane 9 of fig. 1. represents PGR 
product of water for control. Lane 10 represents PCR amplification of the input into 
the well that was subjected to selection and analyzed by PCR in lane 2 of Fig. 3. 
15 Lane 1 1 represents PCR amplification of the input into the well analyzed in lane 3 of 
Fig. 1. Lane 12 represents PCR amplification of the input into the well analyzed in 
lane 4 of Fig. 3. Lane 13 represents PCR amplification of the input into the well 
analyzed in lane 5 of Fig 3. Lane 14 represents PCR amplification of the input into 
the well analyzed in lane 6 of Fig 3. Lane 15 represents PCR amplification of the 
10 input into the well analyzed in lane 7 of Fig 3. Lane 16 represents PCR amplification 
of the input into the well analyzed in lane 8 of Fig. 3. Lane 17 represents PCR am- 
plification of water for control. 

From Fig. 3 lane 5 it can be concluded that the double stranded DNA complex 
!5 T2+DNP was enriched more than one thousand fold over the T2-B complex. Only 
weak T2-DNP bands are appearing in lane 3-5 suggesting that enrichment of 
T2+DNP is considerably higher than one thousand fold. Further fold enrichment can 
not be concluded from this experiment. Additional rounds of PCR could possibly 
reveal the ratio between T1 and T2 in lanes 6-8. PCR product bands from T2 disap- 
0 pear in input lanes 1 3 to 1 6 since the T1 identifier is much in excess and take over 
the PCR reaction utilizing the 1 pico mole of both F1 and ^P-labeled R1 primers. 
This is also seen in the bottom of the gel where the ^P-labeled R1 primer is disap- 
pearing. 
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Example 4: Selection of an inteorin oV33 liaarid from a 484-member small molecule 
library encoded bv chemetics™. 



5 Overview of the procedure 




Annealing/transfer/extension 




10 



15 



20 



DF : Drug fragment / chemical entity 
B: Biotin 
SA: Streptavidin 

The method for producing a library of bifunctional complexes, in which each member 
of the library comprises a synthetic molecule and an identifier that may be decoded 
to establish the synthetic history of the synthetic molecule comprises several steps, 
exemplified below. In a first step (General procedure 1), four different identifier oli- 
gonucleotides are loaded with a scaffold molecule or drug fragment In this example 
the loading is conducted using an amino group on the identifier oligo as the attach- 
ment point for the drug fragment/ scaffold molecule. The identifiers may be regarded 
as the nascent bifunctional complexes. ; 
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To prepare the building block oligos, identical carrier oligos are initially loaded with 
eleven different drug fragments using general procedure 2. The eleven loaded car- 
rier oligos are then ligated to anti-codon oligos of the first and the second round us- 
ing general procedure 3, thereby obtaining 1 1 building blocks for the first round and 
5 eleven building blocks for the second round. 

The library formation is described in detail in general procedure 4 and includes the 
mixing of the four different identifier oligos with the eleven different building blocks of 
the fist round. To bias the library one of the identifiers and one of the first round 

10 building blocks were added in an amount 100 below the amount of the other com- 
ponents. At conditions providing for annealing between the identifiers and the build- 
ing blocks, a cross-link between the scaffold molecules of the identifier oligo and the 
drug fragments were effected. The identifier oligos were then extended using a 
polymerase and using the anti-codon of the building block as the identifier. After the 

1 5 extension, the drug fragment is released from the building block by cleavage of a 
linkage between the drug fragment and the oligo. The spent building block oligo is 
removed by streptavidin beads. 

The second round includes the addition of building blocks to the nascent identifier- 
10 synthetic molecule complex obtained in the first round. To bias the library, one of the 
eleven second round building blocks was added in an amount 100 times below the 
amount used for the 10 other building blocks. The second round follows the same 
scheme as depicted above for the first round. The library formed is of 4 * 1 1 * 1 1 = 
484 members. One of the members, which is a known ligand for the target, appears 
5 only in a concentration of the library of one out of 3 * 1 0 8 Afunctional complexes. 

The library is then subjected to a selection process, as disclosed in general proce- 
dure 5. The selection involves addition of the library to wells coated with immobilized 
target After incubation of the library with the target, non-binding members of the 
0 library is removed by washing and a linkage between the synthetic molecule and the 
indentifier is cleaved. The cleaved off identifiers were collected and amplified by 
PCR. The amplified identifiers were decoded using general procedure 6. 
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General procedure 1: Loading of identifier ollgos 

10 pL triethanolamine (TEA) (0.1 M in DMF) was mixed with 10 pL Building Block 
(8B) with Pent-4-enai'as an amine protection group (0.1 M in DMSO). From this 
mixture 6.7 pL was taken and mixed with 3.3 pL EDC {1-Ethyl-3-(3- 
5 Dimethylaminopropyl) carbodiimide Hydrochloride ] (0.1 M in DMF) and incubated 
30 minutes at 25°C. 1 0 pL of the Building block-EDC-TEA mixture was added to 1 0 
pL of amino oligo in 0.1 M HEPES buffer ((4-(2-Hydroxyethyl)-1- 
piperazineethanesulfonic acid, SIGMA), pH 7.5 and incubated with the oligo for 30 
minutes. 

10 

During this half hour, another 6.7 pL of BB-TEA mix was mixed with3.3 pL EDC (0.1 
M in DMF) and incubate for 30 minutes at 25°C. 10 pL of this second BB-EDC-TEA 
mixture was then added to the amino oligo mixture together with 1 0 pL of 0. 1 M 
HEPES buffer to maintain a 1:1 ratio of DMSO/DMF: H 2 0. Then the mixture was 
15 incubated for 30 minutes. 

During this half hour, another 6.7 pL of BB-TEA mix was mixed with3.3 pL EDC (0.1 
Min DMF) and incubate for 30 minutes at 25°C. 10pLofthis third BB-EDC-TEA 
mixture was then added to the amino oligo mixture together with 10 pL of 0.1 M 
20 HEPES buffer to maintain a 1:1 ratio of DMSO/DMF: H2O. Then the mixture was 
incubated for 30 minutes. 

The loaded oligo was then purified by gel filtration with columns (Biospin P-6, Bio- 
Rad) equilibrated with water. The pent-4-enal amine protection group was then re- 
25 moved by addition of 0.25 volumes 25 mM l 2 in 1:1 watertetrahydrofuran (THF) and 
incubation at 37°C for 2 hours. The mixture was then purified by gel filtration with 
spin columns (Biospin P-6, BioRad) equilibrated with water. Loaded identifier oligos 
were analyzed by ES-MS. 



c 
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Example 4.1.1 

Identifier oligo 1.1: 5'- 

NSPACCTCAGCTGTGTATCGAGCGGCAGCGTTATCGTCG-3' | 



N : 5-Amino-Modifier 5 (Glen research cat# 10-1905-90) 
S : Spacer C3 CPG (Glen research cat# 20-2913-01) 
P : PC Spacer Phosphoramidite (Glen research cat# 10-4913-90) 



Sequence identifying the 
loaded fragment 



c 



10 



' o Ac' I | 

NH 2 OH HN HN"Y 
| _ _ -J HN^O " — 1 NH 2 



EDC / TEA 



Loaded identifier oligo 1.1 



Loaded identifier oligo 1.1 analyzed by ES-MS: 
1 5 Expected Mass : 1 1 709 Da 
Observed Mass : 11708 Da 



Example 4.1.2 

Identifier oligo 1.2: 5'- NSPACCTCAGCTGTGTATCGAGCGGCAGC AGTGC- 
20 CGTCG-3' 

N : S'-Amino-Modifier 5 (Glen research cat# 1 0-1 905-90) 

S : Spacer C3 CPG (Glen research cat# 20-291 3-01 ) 

P : PC Spacer Phosphoramidite (Glen research cat# 10-4913-90) 



WO 2005/003778 PCT/DK2004/000466 

56 




Loaded identifier oligo 1.2 analyzed by ES-MS: 
Expected Mass : 1 1647 Da 
5 * Observed Mass : 11641 Da 

Example 4.1.3 1 

Identifier oligo 1.3: 5'- NSPACCTCAGCTGTGTATCGAGCGGCAGCGCA2 
10 CACGTCG-3* 

N : S'-Amino-Modifier 5 (Glen research cat# 10-1905-90) 

S : Spacer C3 CPG (Glen research cat# 20-2913-01) 

P : PC Spacer Phosphoramidite (Glen research cat# 1 0-491 3-90) 

15 




Loaded identifier oligo 1.3 



20 



Loaded identifier oligo 1.2 analyzed by ES-MS: 
Expected Mass : 11761 Da ... 
Observed Mass : 1 1 759 Da 
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Example 4.1.4 

Identifier oligo 1 .4: 5 - NSPACCTCAGCTGTGTATCGAGCGGCAG CGGA- 
JACGTCG-3* 

5 

N : ff-Amino-Modifier 5 (Glen research cam 1 0-1 905-90) 

S : Spacer C3 CPG (Glen research cat# 20-2913-01) 

P : PC Spacer Phosphoramidite (Glen research cat# 10-4913-90) 

10 • Loaded identifier oligo: 
O 



15 



Expected Mass : 11775 Da 
Observed Mass : 11775 Da 



General procedure 2: Loading of carrier oligo 

1 0-1 5 nmol of carrier oligo 2 was lyophilized and redissolved in 27 5 pi H 2 Q. To this 
was added 7.5 pi 1 M HEPES pH 7.5, 1 0 pi of 2-amino-pent-4-enal protected (allyl- 
glycine) building block (0.1 M in dimethyl sulfoxide), and 5 pi DMT-MM [4-{4,6- 

20 dimethoxy-1 ,3,5-thiazin-2-y1)-4-methy1morpholinium chloride] (0.5 M in water). The 
mixture was incubated 4-1 6 hours at 25-30X. The oligo was purified by gel filtration 
(Biospin P-6, BioRad). To convert the methyl ester moiety of the building block to a 
carboxylic acid. 5 pi 0.4 M NaOH was added and the mixture was incubated 20 min 
at 80°C. The mixture was then neutralized by adding 10 pi 0.5 M HEPES pH 7.5 and 

25 5 pi 0.4 M HCI. The loaded building block oligo was purified by gel filtration (Biospin 
P-6, BioRad) and analyzed by ES-MS 

Carrier oligo 2: 3 , -2GGAGTCGACACATAGCTCGCp-5 , 



2: Carboxy dT (Glen research cat# 10-1035-90) 
p: 5' phosphate 
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AJJyl glycine building block 



1 



XfyC 
o 

DMT-MM 



Carrier oligo 2 



Loaded carrier oligo 2.1 analyzed by ES-MS: 
10 Expected Mass : 6856 Da 
Observed Mass : 6857 Da 



Loaded carrier oligo 2.1 



15 



10 



Example 4.2.2 



J Ally! glycine building block 



EtOOC. 



•OH 



Carrier oligo 2 



DMT-MM o=^ 



Loaded carrier oligo 2.2 analyzed by ES-MS: 
Expected Mass : 6944 Da 
Observed Mass : 6945 Da 



HOOC n — v 




Loaded carrier oligo 2.2 
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Example 4.2.3 



Allyl glycine building block 




Loaded carrier oligo 2.3 analyzed by ES-MS: 
Expected Mass : 6798 Da 
Observed Mass : 6800 Da 



Example 4.2.4 



Ally! glycine building block 




Loaded carrier oligo 2.4 

Loaded carrier oligo 2.4 analyzed by ES-MS: 

Expected Mass : 6917 Da 
Observed Mass : 6919 Da 
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Table! 



Carrier oligo 
Example 


Structure of loaded 
Carrier oligo 


Expected 
Mass 


Observed 
Mass 


4.2.5 


O 

^OHO-f 
HN HN-< /r^s, 

F 


6924 


6923 


4.2.6 


O 

<*° »°< 

NHHN"/ /r-jv 


6940 


6939 


4.2.7 


O 

>-OH 

— <° Vi*rvr\ 

HN— / 


6920 


6919 


4.2.8 


O 


6940 


6939 


4.2.9 


NH HN-< O 


6830 


6829 


4.2.10 

- 


— < 

NHO O 


5871 


S871 
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4.2.11 


0 


6920 


[6919 




V-OH 






— <°yt^c\ 
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General procedure 3: ligation of anti-codon oligo with loaded carrier oligo 

500 pmol loaded carrier oligo was mixed with 750 pmol anti-codon oligo and 750 
pmol splint oligo. The mixture was lyophilized and redissolved in 15 ul water. Oligos 
were annealed by heating and slowly cooling to 20°C. 15 ul TaKaRa ligase mixture 
(Takara Bio Inc) was added and the reaction was incubated at 20°C for 1 hour. The 
mixture was purified by gel filtration (Biospin P-6, BioRad) and the efficiency of the 
ligation was checked by running an aliquot on a Novex TBE-UREA gel (Invitrogen). 



15 



Examples of building block oligos for first round of encoding 
Example 4.3.1.1 




Anti-codon oligo 



3 ' - 2 GGAGTCGACACATAGCTCGCp CGTCGI 1 1 1 1 GCAGCCA- 
AT&GTCGT-X :::::::::::::: 

Splint oligo: TCGAGCG — GCAGCCA 



20 




Ugase 



3 ' -2GGAGTCGACACATAGCTCGCCGTCGII 1 1 IG- 
CAG CCAATA GTCGT-X 



2: Carboxy dT (Glen research cat# 10-1035790) 



Building block 
oligo 3.1.1 
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P: 5' phosphate 
X 5" biotin 

Efficiency of ligation : > 95 % 
Example 4.3.1.2 



HOOC. _ 



3 ' -2GGAGTCGACACATAGCTCGCCGTCGIIIIIGCAGCCGTGTGTCGT-X 

Efficiency of ligation : > 95 % 



Example 4.3.1.3 




H ? H 



HN Y ° 
O 



3 ' - 2GGAGTCGACACATAGCTCGCCGTCGI 1 1 1 IGCAGCTCACGGTCGT-X 
Efficiency of ligation : > 95 % 
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Table II 



Building 

block 

oligo 

example 


Structure of loaded 
Drug fragment 


Building block oligo sequence 
. 2: Carboxy dT (Glen researchcatf 1 0- 
1035-90) 
X:5'biotin 


Ligation 
effi- 
ciency 


4,3.1.4 


JW>. 

0=( / r-COOH 
HN — ( 


-jTz — ~ 

2GGAGTCGACACATAGCTCCCCGTCGIIIIIGCAGCCCTA 
1GTCGT-X 


>95% 


; 4.3.1.5 


q HO— 1 
HN HNH 


F 


3'- 

2GGAGTCGACACATAGCTCGCCGTCC 1 1 1 1 IGCAGCGCGA 
I £GTCGT— X 


>95% 


4.3.1.6 




0 


— : ' - 

3 GGAGTCGACACATAGCTCGCCGTCG 1 1 1 1 IGCAGCGACC 
A/3TCGT-X 


>95% 


4.3,1.7 


c 

HN— 




3'- 

3GGAGTCGACACATAGCTCGCCGTCGIIIIIGCAGCACAA 
SCTCGT-X 


>95% 


4.3.1.8 


/> * 

NH HN 




2GGAGTCGACACATAGCTCGCCGTCGI III IGCAGCTGGA 
2GTCGT-X 


>95% 


4.3.1.9 


NH HN-( P ! 


rr ■ — -- 

>GGAGTCGACACATAGCTCGCCGTCGIIIIIGCAGCGCTC 
jGTCGT-X 


>95% 
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14.3.1.10 


C 0 


3'- 

3GCMGTCC»CACATAGCTCGCCCTOTIIIIIGCAGC^^ 
flGTCGT-X . 


>95% 


4.3.1.11 




■J7Z — — 

2GGAGTCGACACATAGCTCGCCGTCGI III IGCAGCCCGG 
AGTCGT-X 


>95% 



Examples of building block oligos for second round of encoding 
Example 4.3.2.1 
Building block oligo 3.2.1 : 




0 ly^ 



3 ' -2GGAGTCGACACATAGCTCGCCGTCG 1 1 1 1 IGCAGCI 1 1 1 IGTCGTCAATA- 
CAGCTTAGACGGTAGATTTX 



Efficiency of ligation : > 95 % 
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Example 4.3.2.2 




3 ' - 2 GGAGTCG ACACATAGCTCGCCGTCGI 1 1 IIGCAGCI 1 1 1 IGTCG TCGTGT CAG - 
- CTTAGACGGTAGATTTX 

Efficiency of ligation : > 95 % 



Example 4.3.2.3 




. 3 ' -2GGAGTCGACACATAGCTCGCCGTCGIIIIIGCAGCII I IIGTCG TTCACG- 
C AGCTTAGA -CGGTAGATTTX 

Efficiency of ligation : > 95 % 
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Table III 



Building 
block ol'tgc 

' example 


Structure of 
> loaded 

Drug fragment 


Building block oligo sequence 

2: Carboxy dT (Glen research cat# 10-1035-90) 

X: 5* biotin 


Ligation 
effi- 
ciency 


4.3.2.4 


o=< r<x 

HN-< 

'1 


3'- 

2»3AGTCt^CACATAGCTCGCCGTCGIIIIIGCAGCIIIIIGTCGT 
> CCTATCAGCTTAGACGGTAGATTTX 


>95% 


4.3.2.5 


— >T< 

HN HN-< 




3GGAGTCGACACATAGCTCGCCGTCGIIIIIGCAGCIIIIIGTCGT 
GCGifiCAGCTTAGACtSGTAGATTTX 


>95% 


4.3.2.6 






3 ' - . . 
2Gt3AGTCGACACATAGCTCGC«STCGIIIIIGC^ 


>95% 




c 

NH HN- 


S£GG&CAGCTTAGACGGTAGATTTX 




4.3.2.7 




>- 


3'- 


>95% 




— 


^ 


A£A^A££ACCTTAGACGGTAGATTTX 




4.3.2.8 




3'- 

2GCAGTCt3K»CATA«rrCGCC^^ 
QSGj&CCAGCTTAGACGGTAGATTTX 


>95% 


4.3.2.9 


— <° • 

NH HN- i 


i'- 

«<1ACTCGACACATAGCTCGCC^ 
yTTCGCAGCTTAGACGGTAGATTTX 


>95% 
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4.3.2.10 


— < 


3 «• 

2GOUnxXACACMAGCTCGCCGra 
CATAGCAGCTTAGACGGTAGATTTX 


>95 % 


4.3.2.11 


\ 

.oo V 

— -< 


3 ' ~ 

2GGAGTCGACACJ^TAGCTCGCCGTOCIIIIIGCACCIIIIZGTC6T 
CCGGACAGCTTAGACGGTAGATTTX 


> 95 % 



General procedure 4: Encoding a small molecule library by chemetics™ 
Example 4.4.1: Encoding a 484-member small molecule library by chemetics 7 * 1 
Example 4.4.1.1 First encoding round 

2 pmol of loaded identifier oligo 1.1 was combined with 200 pmol of each loaded 
identifier oligo 1.2, 1.3, and 1.4. (602 pmol loaded identifier oligos in total). These 
were mixed with 0.7 pmol building block oligo 3.1. 3., and 72.7 pmol each of 10 dif- 
ferent other first round building block oligos (eg. 3.1.1 and 3.1.2; 727 pmol loaded 
building block oligos in total). The oligos were lyophilized and redissolved in 50 pi 
extension buffer (EX) [20 mM HEPES, 150 mM NaCI, 8 mM MgClJ. The mixture 
was heated to 80°C and slowly cooled to 20"C to allow efficient annealing of identi- 
fier and building block oligos. 5 pi of 0.5 M DMT-MM in water was added and the 
mixture was incubated at 37°C for 4 hours. 

Extension of the identifier oligo on the building block oligo identifier was performed 
by adding 3 pi of a 1 0 mM mixture of each deoxynucteotide triphosphate [dATP. 
dGTP, dCTP, dTTP] and 3 pi of 13 units/ul Sequenase (Amersham Biosciences). 
The mixture was subsequently incubated at 30°C overnight. Then 3 pi of 2M NaOH 
was added and the mixture was incubated for80°C for 10 minutes followed by neu- 
tralization by addition of 3 pi 2M HCI. The mixture was then purified by passing 
. through a gel filtration column (Biospin P-6, BioRad). 0.25 volumes of 25 mM \ 2 in 
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1:1 THF: water was added, mixed and incubated at 37°C for 2 hours. 60 ul binding 
buffer (BF) (1 00 mM HEPES, 1 50 mM NaCO and water ad 300 pi was added. 

' The mixture was added to streptavidin-sepharose beads (Amersham Biosciences) 
5 pre-washed 3 times in BF buffer and incubated at room temperature for 10 minutes 
followed by incubation on ice for 10 minutes with gentle stirring. The beads were 
then washed three times with water. Extended identifier oligos were stripped from 
the building block oligos bound to the streptaviding-sepharose beads by applying 
100 pi NH3 1:1 in water and incubating at room temperature for 5 minutes. 

4.4.1.2 Second encoding round 

To the eluate was added 0.36 pmol second round loaded building btock oligo 3.2.2 
and 36.4 pmol each of 10 different other second round building block oligos (eg. 
3.2.1 and 3.2.3; 364 pmol loaded second round building block oligos in total) and the 
mixture was lyophilized and redissolved in 50 pi EX buffer. The encoding was per- 
formed essentially as described under 4.1.1. 

4.4.1.3 Final extension 

The eluted identifier oligo were tyophilized and dissolved in 50 pi EX buffer. Then 
200 pmol primer E38 [5'-XTTTTAGATG GC AGAT-3', X=CXS Biotin] was added. An- 
nealing was performed by heating the mixture to 80°C and slowly cooling to 20°C. 
Extension of the identifier oligo was performed by adding 3 pi of a 10 mM mixture of 
each deoxynucleolide triphosphate [dATP, dGTP, dCTP, dTTP] and 3 ul of 13 
units/ul Sequenase. The mixture was subsequently incubated at 30°C for 2 hours. 
The mixture was then purified by passing through a gel filtration column (Biospin P- 
6, BioRad). This eluated was used for selection. An aliquot (sample 4.1.3) was re- 
moved for analysis of the inpout in the selection procedure. 

General procedure 5: selection 

Maxisorp ELISA wells (NUNC A/S t Denmark) were coated with each 100 pL 2ug/mL 
integrin aVJ33 (Bachem) in PBS buffer [2.8 mM NaHaPO^ 7.2 mM Na 2 HPO Ai 0.15 M 
NaCI, pH 7.2] overnight at 4°C. Then the integrin solution was substituted for 200 pi 
blocking buffer [TBS, 0.05% Tween 20 (Sigma P-9416), 1% bovine serum albumin 
(Sigma A-7030), 1 mM MnCIJ which was left on for 3 hours at room temperature. 
Then the wells were" washed 10 times with blocking buffer and the. encoded library 
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was added to the wells after diluting it 1 00 times with blocking buffer. Following 2 
hours incubation at room temperature the wells were washed 10 times with blocking 
buffer. After the final wash the wells were cleared of wash buffer and subsequently 
inverted and exposed to UV light at 300-350 nm for 30 seconds. Then 1 00 pi block- 
5 ing buffer without Tween-20 was immediately added to each well, the wells were 

shaken for 30 seconds, and the solutions containing eluted identifiers were removed 

for PCR analysis (sample 5. 1) 

General procedure 6: analysis of selection Input and output 

10 PCR was performed on the input for (sample 4.3.1) and output of (sample 5.1) the 
selection using primers corresponding to the 5' end of the identifier oligos and the 
E38 primer. PCR was performed using Ready-To-Go (RTG) PCR beads (Amersham 
Biosciences) and 10 pmol each primer in a reaction volume of 25 pi. The PCR reac- 
tion consisted of an initial denaturation step of 94 °C for 2 minutes followed by 30-45 

15 cycles of denaturation at 94°C for 30 seconds, annealing at 58°C for 1 minute and 
extension at 72°C for 1 minute. A final extension step of 2 minutes at 72°C was in- 
cluded. The PCR products were resolved by agarose gel electrophoresis and the 
band corresponding to the expected size was cut from the gel and purified using 
QIAquick Gel Extraction Kit (QIAGEN). 

20 

To sequence individual PCR fragments the purified PCR products were cloned into 
the pCR4-TOPO vector (Invitrogen) according to the manufacturer's instructions. 
The resulting mixture was used for transformation of TOP10 £. coti cells (Invitrogen) 
using standard procedures. The cells were plated on growth medium containing 100 
25 pg/ml ampicillin and left at 37°C for 12-16 hours. 

Individual £co//dones were picked and transferred to PCR wells containg 50 pi 
water. These wells were then boiled for 5 minutes and 20 pi mixture from each well 
was used in a PCR reaction using RTG PCR beads and 5 pmol each of M1 3 for- 
30 ward and reverse primers according to the manufacturer's instructions. A sample of 
each PCR product was then treated with Exonuclease I (USB) and Shrimp Alkaline 
Phosphatase (USB) to remove degrade single stranded DNA and dNTPs and se- 
quenced using the DYEnamic ET cycle sequencing kit (Amersham Biosciences) 
according to the manufacturer's instructions and the reactions were analyzed on a 
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MegaBace 4000 capillary sequencer (Amersham Biosciences). Sequence outputs 
were analyzed with Cont/gExpress software (Informax Inc.). 
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Overview of drug fragments present In the library: 
Table IV 



Identifier 


Building block oligo for first 
round 


Building block oligo for second 
round 


Oligo 


Rela- 
tive 

amount 
in li- 
brary 


Structure 
of drug 
fragment 


Oligo 


Rela- 
tive 

amount 
in li- 
brary 


Structure of 
transferred 
drug frag- 
ment 


Oligo 


Rela- 
tive 

amount 
in li- 
brary 


Structure of 
transferred 
drug fragment 


•1.1 


100 


V 

O' 


3.1.1 


1 


o 


3.2.1 


100 


0 


1.2 


1 


NH 

cAd»? 


3.1.2 


100 


NH 

<A>.? 


3.2.2 


100 


NH 


1 .3 


inn 


o 

% 


O. I .o 


inn 

1UU 


\ N 
O 

o 


3.2.3 


1 


O 

o 


1.4 


100 


o 


3.1.4 


100 




3.2.4 


100 


o 








3.1.5 


100 

• 


F 


3.2.5 


100 


F 








3.1.6 


100 




3.2.6 


100 


4^ 
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3.1.7 


100 


V 


3.2.7 


100 


V 








3.1.8 


100 




3.2.8 


100 










3.1.9 


100 


HN — ( 


3.2.9 


100 . 


HN — ( % ^3 








3.1.10 


100 


-< 


3.2.10 


100 


p 








3.1.11 


100 

\ 


\ 


3.2.11 


100 

{ 


i 



The library had the potential to encode the integrin oVp3 ligand A (Molecule 7 in 
Feuston B. P. et al., Journal of Medicinal Chemistry 2002, 45, 5640-5648) from 1 
of 3*10 8 identifiers. 



^^^^ 



As can be seen from the table above, the library had the potential to encode ligand 
A for every 3'lu 8 identifiers (1 x 1 x 1 = 1 out of every 301 x 1001 x 1001 - 3*10') 
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Example 4.6.1: Result of sequencing analysis of Input for selection procedure 
and output from selection procedure. 

The codon combination compatible with encoding of ligand A was not found in 28 
sequences derived from the encoded library before selection in agreement with the 
expected low abundance of this codon combination (1 in 3*1 0 a ). 

A codon combination compatible with encoding of ligand A was found in 5 out of 19 
sequences derived from the encoded library after selection in integrin aV£3-coated 
10 wells. 

These numbers correspond to an enrichment factor of (3*1 0 a / (19 / 7)) = 8*1 0 7 . 
Example 5: Selection of encoded molecules using size-exclusion column 

15 

This example illustrates the possibility to use column separation to perform selection 
on complexes against various targets. In this example, size-exclusion chromatogra- 
phy (SEC) is used, but other types of chromatography can be used where target- 
bound complexes are separated from the non-bound complexes. 

The complex is exemplified in this example by a biotin molecule attached to an oli- 
gonucleotide sequence with a predetermined sequence. Thus, the nucleotide se- 
quence of the identifier specifies the identity of the synthetic molecule as biotin. The 
encoding sequence can have any length and be divided into discrete regions for 
encoding various building blocks as discussed elsewhere herein. Also, the displayed 
molecule can have a linear or scaffold structure. 

Biotin-AATTCCGGAACATACTAGTCAACATGA 

Biotin is known to bind to streptavidin. The binding of biotin to streptavidin will link 
the identifier to the target molecule and therefore change the identifiers physical and 
chemical properties, such as e.g. the apparent molecular weight This change is 
possible to detect using e.g. size-exclusion chromatography: 
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78 pmol of the complex molecule was loaded on a Superdex 200, PC 3.2/30 column 
(AKTA-FPLC, AmershamPharmaciaBiotech) and analysed in PBS buffer with a flow 
rate of 0.050 ml/min. As can be seen below, the complex molecules retention-time 
was approximately 35 minutes. When the target (83 pmol streptavidin) was analysed 
5 under identical conditions the retention-time was approximately the same. The low 
absorption of the target molecules is due to the wavelength (260 nm) used in the 
measurement. At this wavelength, the extinction coefficient is high for the nucleo- 
tides in the complexes but low for the protein target. 



10 




However, when the complex molecules was premixed with the target molecules (78 
pmol complex and 83 pmol target incubated for about 1 h in PBS buffer) "to allow 
binding and then analysed under identical conditions, the retention-time change sig- 

1 5 nificantly (28 minutes). The change is due to the increase in molecular weight (or 

hydrodynamic volume) due to the binding of the complex to the target. This will allow 
the separation of the target-bound complexes from the non-bound complexes. The 
fraction that contains the complexes and the target molecules are pooled and ampli- 
fied using appropriate primers. The amplified identifiers can then be used to decode 

20 the structures of the enriched displayed molecules. 

The strategy of performing column-selection of libraries of Afunctional complexes 
has two major advantages. First, the enriched (target-bound) complexes are eluted 
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before the non-bound complexes, which will drastically reduce the background from 
the non-bounded complexes. Secondly, the enrichment on the column will be exten- 
sive due to all the separation steps in the pores in the matrix 

5 The separation of the target-bound complexes using this approach will be depend- 
ent on the molecular weight of the complexes but predominantly of the molecular 
weight of the target The molecular weight of the target can be adjusted by linking 
the target to a support that increases the apparent molecular weight. The increased 
molecular weight will enhance the separation by reducing the retention-time on the 
1 0 ' column. This can be done using for example a fusion protein, antibody, beads, or 

cross-linking the target in multimeric form. Thus, the target protein can be expressed 
as a fusion protein 'or a specific antibody can be use to increase the molecular 
weight The target can be immobilized on small beads that permit separation and 
the target can be cross-linked using standard reagents to form multimers or cross- 
linked to a earner molecule, for example another protein. Preferably, the molecular 
weight is increase so the target molecules elute in the void volume of the column. 

Examples of other types of column separation that can be used are affinity chroma- 
tography, hydrophobic interaction chromatography (HIC), and ion-exchange chro- 
matography. Examples of column media, other that Superdex, that can be used in 
size-exclusion chromatography are: Sephacryl, Sepharose or Sephadex 
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Claims 

1. A method for identifying a synthetic molecule having affinity towards a target, 
comprising the steps of • 

a) providing a library of afunctional complexes, wherein each complex 
of the library comprises a synthetic molecule attached to an identifier, 
which codes for said molecule, 

b) subjecting, under binding conditions, the library of Afunctional com- 
plexes to a target, 

c) removing the non-binding members of the library, 

d) separating the identifiers of complexes comprising synthetic mole- 
cules having affinity towards the target, and 

e) decoding the identifiers to establish the identity of the molecule. 

2. The method according to claim 1. wherein the synthetic molecule of the library is 
a non-a-polypeptide. 

3. The method of claim 1, wherein the synthetic molecule has a molecular weight 
less that 2000 Daltons, preferably less than 1000 Dalton. and more preferred 
less than 500 Daltons. 

4. The method according to claims 1 to 3. wherein the identifier uniquely identifies 
the synthetic molecule. 

5. The method according to any oftheclaiml to 5, wherein the identifier comprises 
a sequence of nucleotides. 

6. The method according to any of the preceding claims, in which the identifier 
comprises two or more codons coding for chemical entities which have partici- 
pated in the synthesis of the synthetic molecule. 

7. The method according to claim 6, in which each codoh comprises 4 or more nu- 
cleotides. 

8. The method according to claim 6, wherein the chemical entities are precursors 
for a structural unit appearing in the synthetic molecule. 

. 9. The method according to daim 6, wherein the chemical enfities are not naturally 
occurring a-amino acids or precursors therefore. 

10; The method according to any of the preceding claims, wherein the chemical en- 
tities are transferred to the nascent synthetic molecule by a chemical building 
block, which further comprises an anrj-codon. 

11. The method according to claim 10, wherein the information of the anfroodon is 
transferred in conjunction with the chemical entity to the nascent complex. 
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12. The method according to claims 10 or 11 in which the chemical entities am re- 
acted without enzymatic interaction. 

13. The method according to claim 6. wherein the codons are separated by a fram- 
ing sequence. 

5 , 14. The method according to any of the preceding claims, wherein the synthesis 
histoiy of the molecule is established by decoding the identifier. 

15. The method according to any of the preceding claims/wherein the synthetic 
molecule and the identifier are joined by a selectively cleavable linkage. 

16. The method according to claim 1. wherein the library comprises two or more 
) ' different complexes. 

17. The method according to claim 1. wherein library comprises 1.000 or more dif- 
ferent complexes. 

18. The method according to claim 1. wherein the library comprises 1.000.000 or 
more different complexes. 

19. The method according to claim 1. wherein the target is of biological origin 

20. The method according to claims 1 to 19. wherein the target is immobilized on a 
solid support 

21. Tbe method according to any of the claims 1 to 20. in which a cleavable linkage 
between the target and the solid support is present 

22. The method according to any of the claims, wherein the separation of identifiers 
of complexes comprising synthetic molecules having affinity towards the target 
involves the cleavage of one or more linkages. 

23. The method of claims 1 5 or 21 . wherein the linkage is selectively cleavable us- 
ing electromagnetic radiation, a chemical agent, or an enzyme. 

24. The method according to any of the preceding claims, wherein the cleavable 

linkage comprises a group NO, , R 2 in which R', R J , and R 3 inde- 

pendently are the synthetic molecule, the identifier, or a group H or OCH 3 re- 
spectively. provided R* is either the synthetic molecule or the identifier; and X is 
selected from the group comprising O, S, or NH. 
25. The method according to any of the preceding claims, wherein the removing of 

the non-binding members of the library in step c) or the separation of step d) in- 

eludes chromatography. 
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26. The method of any of the preceding claims, wherein the cleavage of a linkage 
joining the synthetic molecule and the identifier is preceded by chromatography 
to separate the identifier; 

27. The method according to claims 25 or 26, wherein the chromatography is size- 
exclusion chromatography. 

28. A library of complexes, in which each different complex comprises a synthetic 
molecule, attached via a cleavable linkage to an identifier which codes for said 
molecule. 

29. A library according to claim 28, wherein the selective cleavable linkage com- 
prises a chemical moiety, which can be deaved by electromagnetic irradiation. 

30. A library according to claim 28 or 29. wherein the cleavable linkage comprises a 

group N02 r2 m wnicn R i f R 2 and r3 independent]y are ^ ^ 

thetic molecule, the identifier, or a group H or OCH 3 , respectively, provided R 2 is 
either the synthetic molecule or the identifier; and X is selected from the group 
15 comprising O, S, or NH. 

31, A library according to any of the claims 28 to 30, wherein the identifier comprises 
a sequence of nucleotides. 

32. A library according to any of the claims 28 to 31 , wherein the identifier comprises 
2 or more codons, which codes for 2 or more chemical entities incorporated into 

20 the synthetic molecule. 
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